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Preface 



The papers in this volume were presented at the Seventh Annual International 
Computing and Combinatorics Conference (COCOON 2001), held August 20-23, 
2001, in Guilin, China. The topics cover most aspects of theoretical computer 
science and combinatorics related to computing. The conference was held in 
cooperation with the Chinese Academy of Sciences and China Computer Feder- 
ation. 

Submissions to the conference this year were conducted entirely online. A 
total of 97 papers were submitted in time to be considered, of which 50 regular 
papers and 16 short papers were selected by an international program committee 
consisting of Eric Allender, Bernard Chazelle, Danny Chen, Jianer Chen, Francis 
Chin, Kyung-Yong Chwa, Rod Downey, Erich Gradel, Yuri Gurevich, Steven 
Homer, Toshihide Ibaraki, Tao Jiang, Ker-I Ko, D.T. Lee, Xuemin Lin, Maurice 
Nivat, R. Ravi, Rudiger Reischuk, Seinosuke Toda, Jie Wang, Lusheng Wang, 
Guoliang Xue, and Mihalis Yannakakis. 

The authors of submitted papers come from the following countries and re- 
gions: Australia, Austria, Bangladesh, Canada, China (including Hong Kong and 
Taiwan), Czech Republic, France, Germany, India, Israel, Italy, Japan, Korea, 
New Zealand, The Netherlands, Poland, Russia, Singapore, Spain, Switzerland, 
U.K., and U.S.A. Each paper was given to at least three Program Committee 
members, who in some cases were assisted by subreferees. In addition to the se- 
lected papers, the conference also included two invited presentations by Bernard 
Chazelle and Avi Wigderson. 

To promote young researchers, the Hao Wang Award this year was given 
to a paper selected from papers written solely by authors who, at the time of 
submission, were either students or had received their doctoral degrees within the 
previous five years. I am happy to announce that the recipient of this award was 
Xiang-Yang Li for his paper “Generating Well-Shaped d-Dimensional Delaunay 
Meshes” . 

I would like to thank the program committee co-chair Yuri Gurevich for his 
strong support and all program committee members, their support staff, and 
subreferees for their excellent work within demanding time constraints. I would 
also like to thank all authors for submitting their papers to the conference. I 
am grateful to Steve Tate for letting me use the ACM SIGACT electronic sub- 
mission service, Minghui Li for helping me create conference web pages, and 
Richard Cheek for providing system support. Finally, I would like to express my 
gratitude to Ding-Zhu Du, Xudong Hu, and all local organizers for their hard 
work in making this meeting possible and enjoyable. 
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Jie Wang 
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Complete Problems for Valiant’s Class 
of qp-Computable Families of Polynomials 



Markus Blaser 

Institut fiir Theoretische Informatik, Med. Universitat zu Liibeck 
Wallstr. 40, 23560 Liibeck, Germany 
blaeserStcs .mu-luebeck. de 



Abstract. We prove that the families matrix powering, iterated ma- 
trix product, and adjoint matrix are VQP-complete, where VQP denotes 
Valiant’s class of quasipolynomial-computable families of multivariate 
polynomials. This proves a conjecture by Biirgisser [3, Conjecture 8.1]. 



1 Introduction 

In connection with his famous #P-completeness result [11] for the permanent, 
Valiant presented an algebraic analogue of the theory of N P-completeness [10] (a 
decade before the proposal of Blum, Shub, and Smale [2]). In the present work, 
we prove the completeness of various families of multivariate polynomials for 
Valiant’s algebraic class of qp-computable families of multivariate polynomials. 
For the reader’s convenience, we first present the relevant details of Valiant’s 
theory. More material can be found in [4,3]. 

A function t : N — >■ N is called p-bounded if t{n) < 0{n'^) for some constant c. 
The function t is called p~bounded from above and below if in addition t(n) > 
l7(n^/°). A function m : N — >■ N is called qp-bounded if u{n) < 

The main objects in Valiant’s algebraic N P-completeness theory are certain 
families of multivariate polynomials. Throughout the remainder of this work, k 
denotes a field. 

Definition 1. A sequence f = (/„) of multivariate polynomials over k is called 
a p-family if the number of variables and the degree of f„ are both p-bounded 
functions in n. 

The complexity L{f) of a multivariate polynomial / G k[Xi, . . . ,Xm] is the 
size of a smallest arithmetic circuit over the basis {+,—,*} (all operations with 
fanin two) that computes / from the indeterminates Xi , . . . , and the scalars 
in k. 

Definition 2. 1. A p-family f = (fn) is p-computable if the complexity L{fn) 

is a p-bounded function in n. The class of all p-computable families is de- 
noted by yP (or yPk). 

2. The p-family f is qp-computable if L{fn) is a qp-bounded function in n. 
The class of all qp-computable families is denoted by VQP (or VQP^j. 
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In the same way, we can also define the class VNP, an algebraic analogue of 
NP. Since a detailed definition of VNP is more complicated than those of the 
classes VP and VQP, the reader is referred to [4,3]. The following hypotheses 
play a central role in Valiant’s theory — like Cook’s Hypothesis in structural 
complexity theory. 

Hypothesis 1. 1. Valiant’s Hypothesis: VP C VNP over any field. 

2. Extended Valiant’s Hypothesis: VNP \ VQP yf 0 over any field. 

In Valiant’s model, families of polynomials are compared by a simple type of 
reductions, namely projections. A polynomial f{Xi, . . . ,Xm) is called a projec- 
tion of another polynomial g{Yi , . . . , y„) if there are Si, . . . , s„GfcU{Ai, . . . , X^} 
such that f{Xi , . . . , X^) = g{si , . . . , s„). In other words, / is obtained from g by 
replacing each of the Yi, . . . , by a scalar from k or by one of the Xi,. . . , X^. 

Definition 3. 1. A family f = (/„) is a p-projection of g = (gm), in symbols 

f g, if there exists a function r that is p-bounded from above and below 
such that 

3no Vn > no : fn is a projection of gr{n) ■ 

2. A family f = (/„) is a qp-projection of g = (gm), in symbols f <qp g, if 
there exists a qp-bounded function r that is also p-bounded from below such 
that 

3no Vn > no : fn is a projection of gr(n) ■ 

The relations “is a p-projection of” and “is a qp-projection of” are both 
transitive. Obviously, if g <p /, then also g <qp /. 

The next step is to define VQP-complete families. 

Definition 4. A qp-computable family f is called yQP -complete if g <qp / for 
all g G VQP. 

As expected, if / is VQP-complete and / <qp h, then h is also VQP-complete. 
Up to now, the only known VQP-complete family is the determinant family 
defined by 

DETn = det{Xij)i<ij<n ■ 

(For a proof, see [3, Corollary 2.29].) As our main results, we exhibit further 
VQP-complete families of multivariate polynomials. Specifically, we show that 
the families iterated matrix multiplication IMM , matrix powering POW, and 
adjoint matrix ADJ defined by 

IMMn = trace(Ai • • • A„), 

POIT„ = trace(AL"/3J), 

ADJn = trace(det(A) • X~^) 

are all VQP-complete. (Above, X and the X^ denote n x n-matrices with inde- 
pendent indeterminate entries.) This proves a conjecture by Biirgisser [3, Conjec- 
ture 8.1]. The original conjecture for POW made by Biirgisser is that (trace(A”)) 
is VQP-complete. This remains an open problem. 
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The main technical problems we are confronted with are that we have a very 
weak type of reductions, namely qp-projections, and that we only compute one 
result. 

For any constant c G N, let the family constant size c iterated matrix multi- 
plication c-CIMM be defined through 

c-CIMMn = trace(Fi • • • F„) , 

where the YJy are c x c-matrices (and not n x n-matrices) with independent inde- 
terminate entries. Strengthening the above VQP-completeness result for IMM , 
we also show that for any c > 3, the family <^CIMM is VQP-complete. This 
follows from the results by Ben-Or and Cleve [1]. 

2 Auxiliary Results 

We briefly review some (well known) results, which we will need later on, and 
provide some useful variations of these auxiliaries. For a multivariate polyno- 
mial / G k[Xi^ . . . , Xm] the depth D{f) is the minimal depth of an arithmetic 
circuit over the basis {-P, — , *} (all operations with fanin two) that computes / 
from the indeterminates X\, . . . , X^, and the scalars in k. 

Theorem 1 (Hyafil, Valiant et al.). If f is an n-variate polynomial of degree 
d> 1, then 

D{f) < 0(log(d- L(/))logd-Plogn). 

Moreover, there is an arithmetic circuit of depth 0{log{d • L{f))logd -P logn) 
and size d^L{f)^ that computes f. 

For a proof of the first part of the statement, see [7]. It can also be shown 
that the constant hidden in the O-notation is universal. The second claim is 
due to Valiant et al. [9]. The reader is also referred to [4, Thm. 21.35]. Miller et 
al. [8] present an online construction. 

For technical reasons, we introduce the family c-CIMM' defined through 

c-CIMM'^ = the entry in position (1, 1) of Yi • • • V„, 

where the W are c x c-matrices with independent indeterminate entries. The 
family 3-CIMM' plays an important role for our completeness results. 

Theorem 2 (Ben-Or &: Cleve). Let f be a multivariate polynomial. More- 
over, let I = Then f is a projection of ii-CIMM'^. 

For a proof, see [1, Thm. 1]. Since Ben-Or and Cleve focused on computing 
polynomial size formulas with polynomial length straight-line programs that use 
a constant number of registers, they only stated the above theorem for formulas. 
Nevertheless, Ben-Or and Cleves approach also works for arbitrary straight-line 
programs, since a straight-line program of depth d induces a formula of depth d 
and size at most 2 '^. 
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Since 



(1, l)-entry of Yi • • • 1"„ = trace(£;i,i • Yi • • • Fn • £^ni), 



where ifi.i denotes the c x c-matrix that has a one in position (1, 1) and ze- 
ros elsewhere, c-CIMM'^ is a projection of c-CIMMn+ 2 - Therefore, the below 
theorem follows. 

Theorem 3. Let f be a multivariate polynomial and let I = + 2. Then f 

is a projection of Z-CIMM^. 

3 Completeness Results 

In this section, we prove as our main result the VQP -completeness of the families 
IMM , POW, and ADJ . Since each of these three families is in VP, it suffices to 
show the hardness. 



3.1 Iterated Matrix Multiplication 

Theorem 4. For any c > 3, c-CIMM is \/QP~complete (under qp-projections). 
The same is true for c-CIMM' . 

Proof. The polynomial c-CIMM n can be computed by an arithmetic circuit of 
size 0{n). Thus c-CIMM G VP C VQP. 

Since obviously c-CIMMn is a projection of c'-CIMM n for each n and each 
c' > c, it is sufficient to show that for any / = (/„) G VQP, / <qp 3-CIMM. 
By definition, we know that the number of variables and the degree of /„ are 
p-bounded functions and that T(/n) is a qp-bounded function. By Theorem 1, 
D{fn) < 0(log"* n) for some constant m. By Theorem 3, /„ is a projection of 
H-CIMMi where £ = Hence / is a qp-projection of 3-CIMM . 

The result for c-CIMM' follows completely alike. □ 

Since 3-CIMM ^ is a projection of IMM^, we immediately obtain the follow- 
ing corollary. 

Corollary 1. The family IMM is \/QP-complete (under qp-projections). 

The next corollary is of rather technical nature. It describes a “self reducibil- 
ity” property of c-CIMM . We will rely on the corollary in the subsequent sec- 
tions. 

Corollary 2. Let c > 3 and let i and j he functions N — >■ {l,2,...,c}. Fur- 
thermore, let (a„) and (/?„) he sequences of scalars from k and define w(ji) = 
min{f > n I Oi yf 0}. If w is qp-bounded, then {a„ ■ f„ /3„) is \/QP -complete, 
where /„ is the entry in position (i{n),j{n)) of a generic iterated 3 x 3 -matrix 
product of length n. 
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Proof. We first prove that {c-CIMM'^ + fin) is VQP-complete. Then we reduce 
{c-CIMM'n + fin) to (a„ • /„ + /3„). 

For the first part, let ((/„) G VQP be given. Then also ((/„ — fin) G VQP. By 
the VQP-completeness of c-CIMM', {g„ — fin) is a qp-projection of c-CIMM' . 
Thus {gn) is a qp-projection of {c-CIMM'.^ + fin). Since ((/„) was arbitrary, this 
shows the VQP-completeness of {c-CIMM'.^ + fin). 

Now we reduce {c-CIMM'n + fin) to (a„ • /„ + fin). To obtain c-CIMM'n + fin, 
we substitute w{n + 3) — n — 3 many copies of the c x c-identity matrix and one 
’^w{n+3) multiple of the c x c-identity matrix into a^j^n+s) ■ fw(n+3) + Pw{n+3) 
at arbitrary places except the first and the last matrix. (Note that w{v) >v for 
all V.) As the first and the last matrix, we substitute appropriate permutation 
matrices that move the entry in position {i{n),j{n)) into position (1,1). Since 
w is qp-bounded, {c-CIMM'n + fin) is a qp-projection of («„ • /„ + fin)- □ 



3.2 Matrix Powering 



Our plan is to modify the reduction to iterated matrix multiplication given by 
Cook [5]. The weak type of reductions however is a major obstacle. 

Let X\ , ... , Xm be 3 X 3-matrices with independent indeterminate entries. 
Let Y be the block matrix of size 3m x 3m 



/ 0 Ai 0 ••• 

0 0 A2 



Y = 



0 



0 0 0 Xn,_^ 

\Xm 0 0 ••• 0 y 



(Note the extra Xm in the lower left corner. Without it, all powers of Y would 
have trace zero.) A straight forward calculation shows that 



/Pi 0 0 ••• 0 \ 

0 P2 0 0 



0 0 0 0 
Vo 0 0 ••• PmJ 

where P^ = • • • XmXi ■ ■ ■ A^_i. By the below fact, 

trace(y’”) = m • trace(Ai • • • Xm) = m • "i-CIMMm- 



( 1 ) 



Fact 1. 1. For any two I x I-matrices A and B, trace(AP) = trace(PA). 

2. If we write A = (Aj j)i<i j<g as a block matrix with {i/q) x {i/q)-hlocks Ai j, 
then trace A = trace Ai,i + trace ^ 2,2 + b trace Aq^q. 
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Thus m ■ 3-CIMMjn is a projection of POW^m- If the characteristic of k 
is zero, then m ■ 3-CIMM m is always nonzero. If the characteristic p > 0, 
then at least every second of the m ■ ?>-CIMM m is nonzero. Thus the function 
j I— >■ min{i > j \ i ^ 0 (p)} is qp-bounded. Hence by Corollary 2, the family 
(to • ‘i-CIMMm) is VQP-complete and so is POW. 

We define the family m-POW for a given function to : N — >■ N by 

TO-POW„ = trace(X™("i), 

where X denotes a matrix with independent indeterminate entries of size n x n. 

For functions to such that m{n) < [n/3j for all n, we can embed the 
above [n/3j -powering problem into the TO(n)-powering problem by placing the 
2>m{n) X 3TO-(n)-matrix X together with an (n — 2>m(n)) x (n — 3m(n))-zero 
matrix on the diagonal: 




Thus trace(Z'"(”i) = trace(X™(")) = POIF 3 ^(„) is a projection of m-POWn- 
Therefore, the below theorem follows (by applying the previous construction to 
P 0 IF 3 ^(„), with possibly some of the X^ substituted by identity matrices). 

Theorem 5. For any function m such that m{n) < [n/3j for all n and m is p- 
bounded from below, the family m-POW is yQP -complete (under qp-projections). 



3.3 Matrix Inversion 

Let X\, , Xn be 3 X 3-matrices with indeterminate and independent entries. 
Let N be the 3(n + 1) x 3(n + l)-matrix 



/OXi 0 ■■■ 0 \ 

0 0 ^2 0 



N = 



0 0 0 ■ ■ . 

\0 0 0 ••• 0 / 



The matrix X is nilpotent and we have 



(/ - N)-^ = / + iV + 7V2 + -- - + iV” = 



/hPl.l Pl,2-- - Pl.n\ 

0 I P 2.2 P2,n 



0 0 0 ■ ■ . P„ „ 

Vo 0 0 ••• / / 



(2) 



where / denotes the identity matrix of appropriate size (3n + 3 on the left-hand 
and 3 on the right-hand side) and Pij stands for the product Xi - ■ ■ Xj for i < j. 
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Let S be the permutation matrix 

/O / 0 ••• 0 
0 0/ 0 

S= : : 

0 0 0 ■■ I 

\I0 0 ••• 0 

We have 

( Pi,n I Pi,i • ■ ■ Pl,n-1 ^ 

P2,n 0 / P2,n-2 

{S-'^il-N))-^ = {I-N)-^S= : : . (3) 

Pn,u 0 0 ■ ■ . I 

V / 0 0 ••• 0 / 

Thus 

trace(S'“^(/ — N))~^ = trace = 3-CIMM„. 

Because det(S'“^(/ — N)) = ±1, ±3-CIMMn is a projection of ADJ^n+i)- 
(Since S' is a permutation matrix, S~^{I — N) is a projection of the generic 
3(n + 1) X 3(n + l)-matrix X.) This yields the following completeness result. 

Theorem 6. The family ADJ is \/QP-complete (under qp-projections). 

4 Further Directions 

What happens if we do not take the trace in the definition of the families IMM , 
POW, and ADJ , but look at the following families given by 

IMM'^ = (1, l)-entry of 
POW'^ = (1, l)-entry of XL"/3J , 

ADJ'^ = (1, l)-entry of det(/f) • X~^ ? 

From Theorem 4, (1), and (3), respectively, it follows that these families are also 
VQP-complete. Thus we have the following theorem. 

Theorem 7. The above families IMM' , POW' , and ADJ' are yQP -complete 
(under qp-projections). 

Remark 1. By applying appropriate permutation matrices, we could also define 
IMM' and ADj' not by taking the (1, l)-entry but any other entry and would 
still get the same completeness results for these families. Since Valiant’s model 
is nonuniform, the entries may also vary throughout the whole sequence. 
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Can we achieve a similar result for the family POW'l In fact, we can. The 
proof is however a little more elaborate. 

Theorem 8. For any functions i,j : N ^ N with 1 < i{n),j{n) < n for all 
n, the family {{i{n),j{n)) -entry o/xL”/^J) is a \/QP-complete family (under 
qp-projections). 

Proof. Let w.l.o.g. n = 3m. We distinguish the cases i(ji) = j{n) and i{n) yf j{n). 
In the first case, note that in (I), each entry on the diagonal corresponds to an 
entry of an iterated product of 3 x 3-matrices of length m. 

For the case i{n) yf j{n), note that also the entry in position (1, 2) of T" in (1) 
is an entry of an iterated 3 x 3-matrix product. Let P be the permutation matrix 
that exchanges the first with the t(n)th and the second with the j(n)th row resp. 
column. The matrix P is its own inverse. We now consider PYP instead of Y in 
(1). We have (PYP)"^ = PY^P. By the choice of P, the entry in position (i, j) 
of PY^P is an entry of an iterated 3 x 3-matrix product. 

Altogether, we obtain a VQP-complete family. □ 

Over fields of characteristic zero, we can even prove the VQ P-completeness 
of the family ((1, l)-entry of A") corresponding to the n-powering problem, a 
task which remains unsolved in the case of taking the trace. 

Theorem 9. Let k be a field of characteristic zero. For any t, j : N — >■ N with 
1 < i{n),j{n) < n for all n and any polynomial p, {{i{n), j{n)) -entry o/ 
is a yQP -complete family (under qp-projections). 

Proof. Since p is a polynomial, / is in VP C VQP. 

It remains to show the hardness: let Ai , . . . , A„ be 3 x 3-matrices and define 
N like in (2). The matrix N is nilpotent, specifically, = 0. We have 







V 


=0 ^ ^ 






//p(n).Pi.i ("(;)) • Pi.2 • • 


U-lJ 




0 I p{n) • P 2.2 




0 0 O’ 


' • P{n) ■ Pn,n 




0 

0 

0 


I / 



where / denotes the identity matrix of appropriate size and Pij stands for the 
product Xi - ■ ■ Xj. 

If i{n) yf j(n), then we can move the entry in position (I,n) to position 
(i{n),j{n)) by switching over from (/ + N) to P{I + N)P for an appropriate 
permutation matrix (like in the proof of Theorem 8). After that, the entry in 
position {i{n),j{n)) is the (^^^^)-multiple of an entry of an iterated 3 x 3-matrix 
product. 
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It remains the case i{n) = j{n). We may assume that i{n) = j{n) = 1. 
Otherwise, we switch over to P{I + N)P where P is the permutation matrix 
that exchanges the first and the i(n)th row resp. column. Let T be the matrix 



//O 0 ••• 0\ 
0/0 0 



T = 



0 0 0 ■■•0 

\/0 0 ■■■ I j 



The inverse of T coincides with T except that the identity matrix in the lower 
left corner has to be replaced by —I. A straight forward calculation shows 
that the 3 x 3-matrix in the upper left block of 

equals • Pi,n + /• Thus the entry in position (1, 1) of is just 

(pO)) . 3-CIMM'^ + 1. 

Since the characteristic of k is zero, the coefficient yf 0. Together with 

Corollary 2 this proves the claim of the theorem. □ 



Once we have proven the completeness of IMM , POW, and ADJ, many other 
families of polynomials corresponding to problems from linear algebra turn out 
to be VQP-complete. 

As an example, we examine the family LINEQ 

LINEQ^ = first entry of det(A) • X~^b, 



which corresponds to solving a system of linear equations Xa = b. Above, X 
denotes an n x n-matrix and b a (column) vector of length n with independent 
indeterminate entries. 

If we specialize b\^ 1 and b^ ^ Q iox 2 < v < n, then the image of LINEQ^^ 
under this specialization is ADJ'^. Thus, the family LINEQ is VQP-complete. 

Theorem 10. The family LINEQ is \/QP-complete (under qp-projections). 

Since the permanent family PER is VNP-complete over fields of characteristic 
other than two, the Extended Valiant’s Hypothesis is equivalent to the following 
completely algebraic statements by our VQ P-completeness results: the Extended 
Valiant’s Hypothesis holds iff PER is not a qp-projection of IMM nor of POW 
nor of ADJ nor of any other of the VQP-complete families considered in this 
work. 
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Abstract. The paper presents a simple construction of polynomial length 
universal traversal sequences for cycles. These universal traversal se- 
quences are log-space (even NC^) constructible and are of length 0(n^ °®). 
Our result improves the previously known upper-bound 0(n^'^®) for log- 
space constructible universal traversal sequences for cycles. 



1 Introduction 

One of the major problems in computer science is the graph s, t-connectivity 
problem. It is well known that a directed version of this problem is complete 
for nondeterministic log-space (NL), and its undirected version is complete for 
symmetric log-space (SL). It has been conjectured that symmetric log-space is 
equal to deterministic log-space {SL = L). 

One approach to proving that conjecture is to use universal traversal se- 
quences introduced by Cook (see [AKL-I-].) A traversal sequence for a d-regular 
graph is a sequence of numbers from {!,... ,d}, which directs us in traversing 
the graph. For given d and n, a universal traversal sequence for d-regular graphs 
of size n is a traversal sequence which completely traverses any d-regular graph 
of size n starting at any vertex. [AKL-I-] gave a probabilistic argument for ex- 
istence of universal traversal sequences of polynomial length in d and n. In a 
sequence of papers [N], [NSW], [ATWZ] it was shown using derandomization of 
that argument that SL C DSPACE{log^^^ n). 

A more direct approach to proving SL = L is to construct an explicit 
log-space constructible universal traversal sequence for d-regular graphs, where 
d > 3. Indeed, explicit constructions of universal traversal sequences for spe- 
cific classes of graphs exist. For 2-regular graphs (cycles), [BBK-I-], and [B] 
constructed universal traversal sequences of length which were later 

superseded by log-space constructible universal traversal sequences for cycles of 
length 0(n^-^®) by Istrail [I]. In this paper we present a simple log-space con- 
struction of universal traversal sequences for cycles of length 0(n^ °^) (which we 
believe can be improved up to 0(n^ ®®) by setting parameters of the construction 
optimally.) The best known lower bound I7(n^ ‘^^) and non-constructive upper 

* Supported in part by NSF grant CCR-9734918. 
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bound 0{n^) on the length of universal traversal sequences for cycles are due 
to [BT] and [A], respectively. (See [BT] for comprehensive overview. Recently, it 
was shown in [K2] that a different kind of traversal sequences does not obey these 
lower bounds.) Explicit universal traversal sequences of length for com- 

plete graph were presented in [KPS], and [HW] presented construction of length 
^O(diogd) d-regular expanders. The best construction of universal traversal 
sequences for 3-regular graphs is based on Nisan’s pseudo-random generator [N] ; 
these sequences are of length 

1.1 Overview of the Construction 

Our construction has two parts: 

1. The first part (Section 3) is a construction of an universal traversal 

sequence (UTS) for cycles of length n. This part uses essentially the same idea as 
a construction of Bridgland [B] . It is a recursive construction in which at every 
iteration a UTS for cycles of length cn, c > 2, is constructed from a UTS for 
cycles of length n. Every iteration consists of two dual stages. The depth of the 
recursion is O(logn) and the factor by which the generated universal traversal 
sequence is expanded at each stage is 0{n). 

2. The second part (Section 4) involves again two dual stages that reduce the 
expansion factor to 0(1), and they interleave the stages of the first part. 

At both stages of the first part the traversed graph is reduced to a smaller 
one, whereas at both stages of the second part the graph is expanded a little bit 
by inserting new vertices. After applying all four stages the graph is smaller by 
at least a constant factor. 

The main contribution of this paper lies in the construction of the second 
part where we show that we can modify the traversed graph by inserting new 
vertices so that this modification is transparent for traversal sequences. 

The previous construction by Istrail was also based on Bridgland’s construc- 
tion. Istrail’s construction also reduced the expansion factor to a constant as 
our does but using slightly different (ad hoc) approach. Our construction seems 
more natural and better motivated hence, much simpler to understand. 

2 Preliminaries 

Let G = (V,E) be an undirected cycle. Let V = {v\, . . . ,Vn} so that E = 
{{vi,Vi+i)] 1 < t < n} U {(r:i,u„)}. We may look at every edge (u,v) in G as 
a pair of two directed edges (u,v) and {v,u). For 1 < i < n, the directed edges 
(vi,Vi+i) and (vn,vi) are called the right edges, and {vi+i,Vi) and (vi,Vn) are 
called the left edges. Similarly, Vi+\ is called a right neighbor of Vi, and Vi is a left 
neighbor of Vi+\. At every vertex in G, let one of the outgoing edges be labeled 
by 1 and the other one by 0. We label every vertex in G according to the label 
of its right outgoing edge. 

A 0-1 sequence b{vi),b{v 2 ), ■ ■ ■ , b{vn) uniquely describes G, where b{vi) is the 
label of vertex Vi. We call such a sequence a graph sequence corresponding to G. 
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Two graph sequences are considered to be the same if one of them is a cyclic 
shift of the other one. Hence, we consider any graph sequence to be wrapped 
around (cyclic). We consider all the graphs with the same graph sequence to 
be the same. We identify every graph sequence with its corresponding graph 
(actually a class of the same graphs.) We refer to the digits of a graph sequence 
as vertices. (We use term vertex also for digits of any 0-1 sequence.) 

A traversal sequence is a 0-1 sequence. “To run a traversal sequence t on a 
graph G starting at a vertex v” means to walk in G starting at v and following 
edges labeled consistently with t. Note, if the label of a vertex in the traversal 
sequence matches the label of the traversed vertex then we move to the right 
neighbor, otherwise to the left one. Because we identify a graph with its graph 
sequence we may also talk about running a traversal sequence on a 0-1 sequence. 

Let s be a 0-1 sequence. A 0-run (1-run) in s is any maximal part of s 
consisting only of O’s (I’s). The type of the run, 0-run or 1-run, is called a color of 
that run. The leftmost and rightmost vertices of the run are called border vertices. 
The border vertices of a run of length one coincide. A 0-1 sequence consisting 
only of runs of length i and j is (i, j)-sequence e.g., 000 111 000 111111 000 is a 
(3, 6)-sequence. A sequence consisting of a single run is called monochromatic. 

A 0-1 sequence t completely traverses s starting at vertex v if during traversal 
of s by t starting at v we visit all the vertices of s. The 0-1 sequence t strongly 
traverses s starting at v if it completely traverses s starting at v and for every 
vertex there is a visit to that vertex which comes from left and continues to the 
right, or it comes from right and continues to the left i.e., the vertex is entered 
using one edge and left using the other one. A 0-1 sequence t is a (strong) 
universal traversal sequence for cycles of length n if for any cycle G of length n, 
t completely (strongly) traverses G starting at any vertex. 

Observe, any UTS for cycles of length n -I- 2 is a UTS for cycles of length n-l- 1 
and a strong UTS for cycles of length n. (Proofs of many claims are omitted in 
this extended abstract due to the space limitation; see [Kl] for proofs.) 

3 An Universal Traversal Sequence 

3.1 A Parity Contraction 

Consider running a 1-run t of length I on a 1-run r\ of length n < I starting at 
a left border vertex of ri. Assume that ri is followed by a 0-run tq. It is easy to 
see that if the parity of I and n are the same then we end up at the left border 
vertex of tq, else we end up at the right border vertex of ri. By symmetry, we 
get symmetric behavior for running a 0-run on a 0-run starting at the left border 
vertex, and for running a 1-run on a 0-run and a 0-run on a 1-run starting at 
the right border vertices. Hence, if the parity of the length of the traversed run 
corresponds to that of the traversal run we end up at the border vertex of the 
next run, otherwise we end up at the opposite border vertex of the same run. 

This motivates the following definition. Let s be a non-monochromatic graph 
sequence, and let s be the same as rori • • • r^_i, where every is a monochro- 
matic run and for every 0 < i < k, ri has an opposite color than modfe- 
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For 0 < i < fc, let r' be a run of the same color as Vi, and let the length of r' 
be one if is of odd length and two otherwise. Then a parity contraction of s is 
a sequence rgr^ • • • I.e., we replace every run of odd length by one vertex 

run, and every run of even length by two vertex run. 

There is a natural mapping of border vertices of original runs to border 
vertices of reduced runs. In the case of odd length runs, both border vertices are 
mapped to the same vertex. Observe that for traversing a parity contraction it 
suffices to consider (l,2)-sequences. 

Proposition 1. Let s be a non-monochromatic graph sequence of length I, and 
let S0 be its parity contraction. Let v be a border vertex in s and w® the corre- 
sponding border vertex in S0. Lf a traversal (l,2)-sequence ^0 strongly traverses 
S0 starting at U0 then t, obtained from t(^ by inflating every run by l-\-{l mod 2) 
vertices, strongly traverses s starting at v. 

3.2 A Pair Contraction 

Let s be a (l,2)-sequence. We call a 0-run (1-run) of length 2 in s a 00-pazr 
(ll-pair). A 0-run and a 1-run of length 1 are singletons and any maximal part 
of s consisting of singletons is 01-ran. (A 01-run does not necessary start by 0 
and end by 1.) Let us consider a graph sequence s containing a 01-run followed 
by a 11-pair and another 01-run: 01010 11 0101010101 . . . , and let us consider a 
traversal sequence t = 01010101010 11 01010101. . . . Starting t at the first vertex 
of s, we reach the left vertex vl of 11-pair after 5 digits of t. The next 1 in t 
brings us to the right vertex wr of the 11-pair in s, the following 0 in t takes us 
back to the vertex vi^, and so on. When 11-pair in t comes we get to the vertex 
following the 11-pair in s and we continue to run to the right. By symmetry, we 
observe similar effects for the other possible colorings of the pairs in s and t. 

Hence, if s is a graph (l,2)-sequence and we traverse s by a (l,2)-sequence 
t then whenever we reach a pair in s during that traversal we stay at that pair 
until a pair appears in t and if the color of the pairs in t and s is the same 
then after that pair in t, the traversal goes to the right neighbor of the pair in 
s otherwise to the left one. Even if several pairs are next to each other in s not 
separated by any 01-run, this behavior of the traversal can be observed on each 
of these pairs. 

This motivates the following construction. Let s be a (l,2)-sequence contain- 
ing at least one pair. A pair contraction Sc of s is a 0-1 sequence obtained from 
s by first removing all singletons, and then replacing every pair by one vertex of 
the same color e.g., 01010 11 010 11 01 00 10. . .is contracted to 110. . . . Note, 
the length of Sc is at most half of the length of s. 

Proposition 2. Let Sc be a pair contraction of a (l,2)-sequence s of length 1. 
Let t be a traversal sequence obtained from a strong traversal sequence tc of Sc 
starting at Vc by replacing every 1 by 011(01) , and every 0 by 001(01) . 

Then t is a strong traversal sequence for s starting at the left vertex of a pair 
corresponding to Vc, if Vc is colored 0, and at the right vertex of that pair other- 



wise. 
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3.3 Putting Them Together 

Propositions 1 and 2 give us a way to construct a universal traversal sequence of 
length . Given a strong universal traversal sequence for cycles of length 

n, we construct a strong UTS t 2 n for cycles of length 2n using the construction 
of Proposition 2 and then the construction of Proposition 1 as follows. 

We first replace every 1 in by 011(01)" and every 0 in by 001(01)" as in 
Proposition 2 to get Then we prepend (01)" in front of to get which 
is a (l,2)-sequence. Now, we expand every run in by 2n = 2n+ (2nmod2) 
vertices as in Proposition 1 to get Finally, we prepend P" in front of 
to get t 2 n- We call the prepended sequences (01)" and 1^" loaders. 

We claim that t 2 n is a strong UTS for cycles of length 2n. Let s be a graph se- 
quence of length 2n, let s® be its parity contraction (assuming s is not monochro- 
matic), and let Sc be the pair contraction of Sq (assuming S 0 contains at least 
one pair.) Clearly, |sc| < n, so strongly traverses Sc starting at any vertex. 
The reader may easily verify using Proposition 1, that strongly traverses S 0 
starting at any vertex, and hence using Proposition 2, t 2 n strongly traverses s 
starting at any vertex. (Loaders bring us to appropriate vertices so Propositions 
1 and 2 are applicable.) 

We need a strong UTS for cycles of length (let’s say) 5 to start this recursive 
construction of UTS’s. 

4 An 0{n^) Universal Traversal Seqnence 

A disadvantage of the previous construction is the super-polynomial length of 
the produced UTS. The reason for this length is a pessimism of the construction 
in two directions. The construction is pessimistic about the length of runs at 
each step, and about the number of rounds the construction has to be repeated. 

A possible way how to reduce the overall length of the resulting UTS is to 
protect us against appearance of long runs. The way how we achieve this is by 
introducing two new stages in the construction which break down long runs. One 
of these stages takes place before parity contraction and the other one before 
pair contraction. The former stage reduces the length of 0-runs and 1-runs to 
constant length, the latter one reduces the length of 01-runs to constant length. A 
consequence of that run length reduction is an expansion of a traversal sequence 
in Propositions 1 and 2 just by a constant factor. 

Both run breaking stages split long runs by inserting extra vertices in them. 
This actually increases the length of the graph sequence but in the consecutive 
contraction stages these increases are eliminated. The insertion is done in a way 
which ensures that any generated traversal sequence behaves on the new graph 
sequence and on the old one in the same way. 

4.1 1-Run and 0-Run Breaking 

This stage is aimed at breaking down long 0-runs and 1-runs. We are going to 
use the following simple proposition for m equal 3 and 5. 
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Proposition 3. Let m > 3 be an odd integer. Then for any I > m there is a k 
and even d such that 2 < d < 2m and I = mk + d. 

Let s be a graph sequence. We replace every run of length five and every run 
of length longer than six in s by another sequence to obtain a graph s' with all 
runs of short length. The replacement is done in the following way. Let r be a 
0-run or 1-run of length I = 3k + d G {5, 7, 8,9,.. .} in s, where k and d are from 
the previous proposition. If r is a 0-run then we replace it by r' = (000 01)^0*^ 
otherwise we replace it by r' = (111 10)^1'^. We call this operation 0-run and T 
run breaking (or simply 1-run breaking.) We refer to r as to the original sequence 
and to r' as to the stuffed sequence. 

Lemma 1. Let s' be obtained from a non-monochromatic graph sequence s by 
Trun breaking. Let v be a border vertex of some run in s and let v' be its cor- 
responding vertex in s'. Let (3,6)-sequence t strongly traverse s' starting at v' . 
Then t, starting at v, completely traverses s, and all vertices in s, except possibly 
v, are strongly traversed. 

This lemma is proven by focusing attention on behavior of (3,6)-sequences 
on original and stuffed runs. It can be shown that any (3,6)-sequence t behaves 
the same on any original run r and its corresponding stuffed sequence r' in the 
following sense. If we run t starting at leftmost (rightmost) input vertices v and 
v' of r and r', respectively, then t keeps us in r as long as it keeps us in r' , and it 
leaves r and r' at the same time on the corresponding output vertices, i.e., either 
at wi and or v^ and (Fig. 1.) While running t we observe our position only 
after every whole run in t. (Sequence t may temporarily leave r or r' during a 
run in t.) More formal treatment is given in the full version of this paper [Kl]. 



r ... 000 1 11 1 11 1 11 1 11 1 ... 1 11 11 11 000 .. . 

H t 

V, V V 

I r 

r' ...000 111 10 111 10 111 10 ... Oil 11 11 000 ... 

M —I 

V/ V ' a V' 

1 r 



Fig. 1. 



Let s and s' be as in Lemma 1. Let t be a traversal (3,6)-sequence which 
strongly traverses s' starting at any vertex. Let sequence t' be obtained from t 
by prepending a run w (a loader) of length |s| of opposite color than the first run 
of t. Then Lemma 1 implies that t' strongly traverses s starting at any vertex. 

We conclude that stuffing long runs of O’s and I’s is transparent for (3,6)- 
traversal sequences. The analysis is done explicitly for (3,6)-sequences but es- 
sentially identical analysis can be carried out for any (2i -|- l,4i -|- 2)-sequence, 
for z > 1, where we stuff long runs every (2z -|- 1) vertices. 

The following proposition describes behavior of (3,6)-sequences on short runs. 
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Proposition 4. 1. Any (3,6)-sequence started at left border vertices or right 
border vertices behaves the same on runs 11, 1111 and 111111, where the 
output vertices are vertices of the other color adjacent to these runs in graph 
sequences that contain these runs. The same holds for 00, 0000 and 000000. 

2. Any (3,6)-sequence started at left border vertices or right border vertices 
behaves the same on runs 1 and 111, where output vertices are as above. The 
same holds for 0 and 000. 

Let s' be a (1,2) -sequence and t' be a strong traversal sequence of s' starting 
at a vertex v'. We say that t' extra strongly traverses s' if every pair in s' is 
traversed by t' in such a way that it is entered from outside at one border vertex, 
walked back and forth on it and then left through the other border vertex. 

Lemma 2. Let s be a non-monochromatic graph sequence with every run of 
length from {1,2, 3,4,6}. Let s® be a parity contraction of s. Let v be a border 
vertex in s, and u® its corresponding vertex in s®. Let be an extra strong 
traversal (l,2)-sequence o/ s® starting at v®. Then t, obtained from by re- 
placing every run of length 1 by run of length 3, and every run of length 2 by 
run of length 6, completely traverses s starting at v. 

Note, if s' is obtained from a non-monochromatic graph sequence s by 1-run 
breaking, and S0 is obtained from s' by parity contraction, then the length of 
S0 is at most the length of s. 

Let be a (l,2)-sequence which extra strongly traverses any graph (1,2)- 
sequence of length |s| starting at any vertex. W.l.o.g. the first vertex in t^ has 
color 0. Let t' be obtained from ^0 by the construction from Lemma 2, and let 
t be obtained from t' by prepending a loader ll®L Then t strongly traverses s 
starting at any vertex. Moreover, t is a strong UTS for cycles of length |s|. 

4.2 01-Run Breaking 

This stage is aimed at breaking down long 01-runs. The method used here for 
breaking long 01-runs shares the same spirit with the previous method of break- 
ing 0-runs and 1-runs. Let us describe this stage. 

A (5,10)-01-se(;Mence is a (l,2)-sequence in which every two consecutive pairs 
are separated by a 01-run and every 01-run is of length five or ten. 

Let s be a graph (l,2)-sequence containing at least one pair. We replace 
every 01-run in s of length I G {7,9,11,12,13,14,...} by a stuffed sequence 
r' according to the following table. Let I = 5k d as in Proposition 3. Let 
ro = 01010, ri = 10101, r(, = 01010 100100, and = 10101 011011. 

k even: 

(rori)'=/2(oi)d/2 ^ 

(riro)'=/2(io)rf/2 ^ (r;r(,)'=/2(io)<i/2 

k odd: 

(rori)('=-i)/2ro(10)^/2 ^ (r(,r'i)('=-i)/2r(,(10)^/2 

(^iro)U-i)/2ri(01)^/2 ^ (r}r(,)('=-i)/2r}(01)^/2 
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We can establish the following lemma which is an equivalent to Lemma 1 . The 
proof of Lemma 3 looks at 01-runs in s and its corresponding stuffed sequences 
in s', and shows that (5,10)-01-sequences behave the same on them with respect 
to vertices v,vi,Vr and v' ,v[,v[ (Fig. 2.) 

Lemma 3 (~1). Let s' he obtained from a graph (1,2) -sequence s containing at 
least one pair by 01-run breaking. Let t he a traversal (5,10)-01-sequence ending 
with a 01 -run of length ten. Let v be a left vertex of some pair in s if that pair 
has the same color as the first vertex of t, or let v be a right vertex of that pair 
otherwise. Let v' be a vertex corresponding to v in s' . Lft extra strongly traverses 
s' starting at v' then t extra strongly traverses s starting at v. 



... 100 10101 01010 10101 01010 10101 0 ... 01 01 01 01 01 01 001 . . . 

t f t 



V V r V 

1 r 



. . . 100 10101 011011 01010 100100 10101 on ... on oi oi oi oi oi ooi . . . 

t t ■■■■■ ■■■■■ , ■■■ ■■■ t 

V' V' ^ ‘1 V 

1 r 



Fig. 2. 

Further, similarly to Proposition 4, any (5,10)-sequence beginning with 01- 
run and ending with a pair started at the leftmost vertices or rightmost vertices 
behaves the same on sequences 10, 10 10, 10 10 10, 10 10 10 10, and 10 10 10 10 10, 
where the output vertices are vertices that are neighbors of the surrounding pairs 
in graph sequences containing these sequences (as in Fig. 2.) The same holds for 
negated sequences, and for sequences 1, 101 and 10101, and their negations. 

We conclude this section with the following lemma. 

Lemma 4. Let s he a (l,2)-sequence containing at least one pair with every 01- 
run having length in {1, 2, 3,4, 5, 6, 8, 10}, and let Sc be the pair contraction of s. 
Let tc be a strong traversal sequence of Sc starting at the vertex Vc corresponding 
to some pair in s. Let v be the left vertex of that pair if the color of that pair is 
the same as the color of the first vertex in tc, or let v be the right vertex of that 
pair otherwise. Let t be obtained from tc using the following rules: 

vertex replace by 

0 followed by 0 00 10101 

0 followed by 1 (or nothing) 00 1010101010 

1 followed by 1 11 01010 

1 followed by 0 (or nothing) 11 0101010101 

Then t extra strongly traverses s starting at v. 

Note, if s' is obtained from a (l,2)-sequence s containing at least one pair by 
01-run breaking, and Sc is obtained from s' by a pair contraction, then |s| > 2|sc|. 

Let tc be a UTS for cycles of length |"|s|/2] . W.l.o.g. the first vertex in tc has 
color 1. Let t' be obtained from tc by the construction from Lemma 4, and let 
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t be obtained from t' by prepending a loader 0(10)Lbl/2J , Then t extra strongly 
traverses s starting at any vertex. Moreover, t extra strongly traverses any graph 
(l,2)-sequence of length |s|. 

We choose here to stuff 01-runs every 5 singletons. Similarly to the 1-run 
breaking stage, we could choose to stuff 01-runs every 2f -|- 1 singletons, for 
i> I, and use for traversal a (2i -|- l,4i -|- 2)-01-sequence. 



5 The Construction 

Lemmas 4, 3, 2, and 1 combine to give a construction that converts a strong 
UTS for cycles of length m into a strong UTS for cycles of length 2m. This 
is presented explicitly in step 3 of the algorithm below. Here is the algorithm 
constructing a UTS for cycles of length n. Let k = |"log 2 n] . 

The Algorithm: 

1. Let to be a strong UTS for cycles of length 6, that starts with color 1. 

2. Apply the construction of Proposition 2 to to to obtain a sequence t'^, i.e., 
replace every 1 with 011(01)® and every 0 with 001(01)®, and attach in front 
of t'l sequence 0(10)® to get ti. 

3. For i = l,...,fc — 1, construct a sequence t^_^_i from ti by applying the 
following rules: 

replace by 

a 00-pair [00 10101]® 00(10)® 
singleton 0 [00 lOlOl]^ 00(10)® 
all-pair [11 01010]® 11(01)® 
singleton 1 [1101010]^ 11(01)® 

Prepend loader 0(10)®'"^* [11 01010]®'^Tl(01)® in front of to get a se- 
quence U+i. (This step is a composition of the constructions from Lemmas 
2 and 4.) 

4. Apply the construction of Lemma 2 to tk to obtain a sequence t', and attach 
sequence 1” in front of t' to get a sequence t. 

Sequence t produced by this algorithm is a universal traversal sequence for 
cycles of length n. By analyzing the algorithm it is easy to see that we can 
associate a derivation tree of logarithmic depth with the produced sequence. 
Every node in that tree corresponds either to a pair or a singleton, and certain 
nodes correspond to loaders. Hence, all the nodes, except the nodes correspond- 
ing to loaders, are of bounded degree. It is straightforward to traverse that tree 
in depth-first manner and to produce sequence t using only logarithmic space. 
There is also a uniform NC^ circuit that, given 1", constructs that sequence. 

The length of the produced sequence is proportional to the number of nodes 
at the last level of the derivation tree. The average number of descendants of a 
node corresponding to a pair or a singleton is at most 25.635. Thus, the number 
of nodes at f-th level of the tree satisfies the recurrence: f{i -I- 1) < 25.635/(t) -I- 
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48 • 4* + 12. We set k = [log 2 n] in the algorithm but it is actually enough to set 
k = [log.^ n] , because it can be shown that any graph sequence shrinks to the 
one-fifth of its original length during repeating the sequence of operations (1-run 
breaking, parity contraction, 01-run breaking, pair contraction) twice. Thus the 
length of the produced sequence is 

There may still be some room for slight improvement. If long 01-runs would 
be stuffed every seven singletons instead of five during the 01-run breaking and 
(7,14)-01-sequences would be used for traversal, the length of the produced se- 
quence would possibly be 
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Abstract. A language is universally polynomial if its intersection with 
every NP-complete language is in P. Such a language would provide an 
automatic method for generating easy instances of intractable problems. 

In this note, we give a complete characterization of universally poly- 
nomial languages that are context-free, answering an open question in 

[4]. 

1 Introduction 

It has been observed that most NP-complete languages have an infinite subset in 
P, i.e. most of these difficult languages have a nontrivial easy subset [2]. Whether 
this property holds in general is an open question with important consequences: a 
negative answer would refute the Isomorphism Conjecture for NP-complete sets, 
whereas a positive answer would imply that public-key cryptosystems based on 
NP-complete languages are inherently insecure. 

Interestingly, it has also been observed that easy subsets for many NP- 
complete languages can be obtained by applying a common restriction to the 
problem domain. For example, 3-COLORING, CLIQUE, INDEPENDENT SETS, 
and PARTITION INTO CLIQUES are NP-complete problems that become poly- 
nomially solvable when the domain is restricted to chordal graphs [5]. This ob- 
servation has led Demaine, Lopez-Ortiz, and Munro to ask whether there exists 
a universal restriction that would always yield an easy subset when applied to 
any NP-complete problem [4] ; the existence of such sets would provide an auto- 
matic method for generating easy instances of intractable problems. Specifically, 
they introduced the concept of universally polynomial languages as those whose 
intersection with any NP-complete language is in P, and the weaker concept 
of universally simplifying languages as those whose intersection with any NP- 
complete language is not NP-complete. These two types of languages together 
are called universally easy languages. It turns out that C, the set of chordal 
graphs, is not a universally simplifying language, since the graph-theoretic prob- 
lem of A-COLORING remains NP-complete for chordal graphs [3]. Assuming 
that P yf NP, it follows that C is not universally polynomial. 

* This research is partially supported by an SCU-IBM Faculty Research Grant. 
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It is easy to see that all finite languages are universally polynomial. In prac- 
tice, we are interested in nontrivial (i.e. infinite) universally polynomial lan- 
guages that are efficiently computable. Note also that these concepts are inter- 
esting only if P NP; otherwise every languages in NP would be universally 
polynomial and no language in NP would be universally simplifying. At present, 
a complete characterization of universally easy languages in P is not known. 
[4] took a first step towards this goal by characterizing universally easy sets 
that belong to a natural subclass of P, namely the class of regular languages. 
Specifically, they showed that 

1. all finite (regular) languages are universally polynomial; 

2. all sparse (regular) languages are either universally simplifying or polyno- 
mial; 

3. no nonsparse regular language is universally polynomial if P yf NP. 

In this paper, we give a complete characterization of universally polynomial 
languages that are context-free, answering a problem left open in [4]. We show 
that the set of universally polynomial context-free languages consists of 

1. all context-free languages if P = NP; 

2. all sparse infinite context-free languages if P yf NP but DEXT = NEXT; 

3. all finite languages if DEXT yf NEXT. 

We also obtain results for non-context-free languages. Specifically, we show 
that 

1. no co-sparse language in P is universally simplifying; 

2. no co-sparse language in NP is universally polynomial if P yf NP. 

The rest of this paper is organized into four sections. Relevant definitions 
and previously known results are stated in Section 2. The complete character- 
ization of universally polynomial context-free languages appears Section 3 and 
the results on co-sparse universally easy languages appear in Section 4. Finally, 
some open questions are given in Section 5. 

2 Preliminaries 

All languages in this paper are over the binary alphabet S = {0, 1}. We use the 
terms language and set interchangeably. We assume that the reader is familiar 
with language classes in the Chomsky hierarchy (e.g. regular and context-free 
languages) and complexity classes (e.g. P, NP, DEXT = (J^ DTIME(2°”), and 
NEXT = NTIME(2‘=”)). It is known that REGULAR and CONTEXT-FREE 

are subclasses of P. See [6] for more details about these classes. 

A is a universally polynomial language if for every NP-complete language C, 
Ar\C G P. Note that every finite languages is universally polynomial, and every 
language in NP is universally polynomial if P = NP. 

i? is a universally simplifying language if for every NP-complete language C, 
BnC is not NP-complete. Note that no language in NP is universally simplifying 
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if P = NP. A language is universally easy if it is either universally simplifying 
or universally polynomial. 

L is a sparse language if the number of strings in L of length at most n is 
bounded above by a polynomial p{n). L is a tally language if L C 0*. Note that 
all finite and tally languages are sparse. The existence of tally languages in NP 
— P has been characterized by the following theorem (due to Hartmanis) [1]: 

Theorem 1. DEXT yf NEXT iff there is a tally set in NP — P iff there is a 
sparse set in NP — P. 

T is a hounded language if there are some binary strings w\, W 2 , ■ ■ ■, w„ 
such that L Q wlw 2 ■ ■ ■ w^. The following characterization of sparse context-free 
languages is well-known: 

Theorem 2 ([8,7]). A context-free language is sparse iff it is hounded. 

3 Characterization of Universally Polynomial 
Context-Free Languages 

We begin by showing that E* N for every nonsparse context-free language 
N , and hence none of them can be universally simplifying. 

Theorem 3. No nonsparse context-free language is universally simplifying. 

Proof. Let L be a nonsparse context-free language over the binary alphabet 
E = {0,1}, and Gl he a, Chomsky normal form context-free grammar for L 
without useless or nonreachable variables. Since L is nonsparse, it is infinite. By 
the Pumping Lemma for context-free languages, there exists some variable T 
that can make a derivation of the form: 

T AoTTo, 

where Xq, Yq are binary strings, and |Ao| > 0 or |Fo| > 0- Suppose in addition 
that the same variable T can make another derivation of the form: 

T^* XiTYi, 

where Xi and Yi are binary strings, and or 

Because contains no useless or nonreachable variables, by iterating the 
above two derivations, we can generate strings in L of the form 



uxiX2...Xkvyk...y2yiw, 



for every fc > 1, where u, v, and w are some binary strings, and each pair (xi, yi) 
is either ((Ao)l^il, (Yo)'^'')> or (Fi)l^«l). 

Define a polynomial-time, 1-1, invertible function / from {0, 1}* to L as 
follows: given a binary string z = h\b 2 ... hn of length n. 



f{z = hib 2 ...bn) = uxb^Xb^ . ..Xh„vyb„ . ..yb^yt^w. 
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Since / is 1-1 and polynomially invertible, the image f{C) of any NP- 
complete set C under / is also NP-complete. This shows that L has an NP- 
complete subset and therefore is not universally simplifying. 

So it remains to show that for nonsparse context-free languages L, the addi- 
tional derivation T XiTYi exists, where We will show 

the contrapositive: if no such additional derivation exists, then L must be a 
bounded language and therefore must be sparse [8]. 

For each variable T of Gl, consider all derivations of the form T =!>’*' XkTYk, 
where Xk and Yfe are binary strings (if they exist) . If for any pair (z, j), (Xj) I = 
and then X, and Xy can be written as Xi = (Xt)®* 

and Xj = {XtYY where [Xt^I = gcd{\Xi\,\Xj\). Similarly, and Yj can be 
written as powers of some Yr of length gcd{\Yi\, \Yj\). Thus any derivation of 
T that begins with T XkTYk will eventually generate a binary string of the 
form (Xt)* Zt{Yt)* ■ Here Zt is a binary string, whose derivation tree contains 
no occurrences of T except at the root. 

Now consider a derivation tree for any word in L. For any long enough 
branch so that it has two occurrences of some variable T, collapse the segment 
starting with the first and ending with the last occurrence of T and replace 
it with a node labeled by {Xt)*T{Yt)* ■ Repeat this pruning process until no 
branch of the tree contains two or more occurrences of some variable. Each leaf 
of the pruned tree is associated with a nested regular expression of the form 
(XtY*IXt 2)* ■■■{XTm)*Z{YT^)* (which appear on the path 

from the root), and the variables Tz are all different. The number of such pruned 
trees as well as the number of labels that can be assigned to each leaf are con- 
stants depending only on L. Hence every word in L has the form wlw 2 ■ ■ -wY 
where k is some constant depending only on L. In other words, L is a bounded 
context-free language and hence it is sparse. □ 

In view of Theorem 3, it is unlikely that any nonsparse context-free languages 
can be universally polynomial. In fact, either all or none of them are. 

Theorem 4. The following are equivalent: 

1. P= NP; 

2. every nonsparse context-free language is universally polynomial; 

3. some nonsparse context-free language is universally polynomial. 

Proof. (1) (2): if P = NP, then the intersection of any context-free language 

with an NP-complete language is a language in NP and hence must be in P; 

(2) (3): obvious; 

(3) (1): suppose L is a nonsparse context-free language that is universally 
polynomial. The proof of Theorem 3 shows that L has a subset C which is NP- 
complete. Hence LDC = C is both NP-complete and in P, so P = NP. □ 

We now turn our attention to sparse context-free languages. Using the same 
technique, we show that 0* L for every infinite sparse context-free language 
L, and hence none of them are likely to be universally polynomial. In fact, either 
none or all of them are. Theorem 1 is crucial in the proof of this characterization. 
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Theorem 5. The following are equivalent: 

1. DEXT= NEXT; 

2. every infinite sparse context-free language is universally polynomial; 

3. some infinite sparse context-free language is universally polynomial. 

Proof. (1) (2): iff DEXT = NEXT, then the intersection of any infinite sparse 

context-free language with an NP-complete set is a sparse set in NP and therefore 
must be in P; 

(2) (3): obvious; 

(3) (1): let L be an infinite sparse context-free language. It follows from 
the Pumping Lemma for context-free languages that there exist strings x, y, u, 
v, w such that |i/| -I- |f | > 0 and L' = {xy^uv^w : t > 0} is an infinite subset of 
L. The map /(O*) = xy’’uv^w is a polynomial-time isomorphism between 0* and 
L' such that for any tally language T, T is in P iff /(T) is also in P. 

Given a tally language T in NP, we construct an NP-complete language Ct 
such that CtP^L = f{T) as follows. Define S = {wx : |w| = |cc|Aa; G SATAwx ^ 
L}. S is clearly in NP. Furthermore, SAT S via the following reduction: 
on input x, enumerate strings w of length |a;| until wx ^ L is satisfied, and 
output wx. Since L is sparse, such a string can be found in polynomial time. 
Furthermore, x G SAT iff wx G S. Thus S is NP-complete. Note also that 
S' n L = 0. Now define Ct = S U /(T). Clearly Ct is also NP-complete and 
Crn/(T) = /(T). 

Since L is universally polynomial, /(T) is in P for all tally set T in NP. Hence 
all tally sets in NP are in P, so DEXT = NEXT. □ 

Combining Theorems 4 and 5 we can now describe precisely the class of uni- 
versally polynomial context-free languages provided we know the relationships 
between P vs. NP and DEXT vs. NEXT. Note that DEXT =/= NEXT implies 
that P yf NP. 

Corollary 1. The class of universally polynomial context-free languages con- 
sists of 

1. all context-free languages if P = NP; 

2. sparse context-free languages if P ^ NP but DEXT = NEXT; 

3. finite languages if DEXT ^ NEXT. 



4 Universally Easy Co-sparse Languages 

We extend the results obtained in Section 3 for non-context-free languages whose 
complements are sparse. Again, it is unlikely that any of the co-sparse languages 
in NP is universally polynomial, but if one is, then all of them are. 

Theorem 6. No co-sparse language in P is universally simplifying. 
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Proof. Let L € P he a, language whose complement is sparse. Define a polynomial- 
time 1-1 function f : E* ^ L as follows: on input w, f searches for the lexico- 
graphically first string v of length |w|, such that vw G L and outputs vw. Such 
a string exists and can be found in polynomial time since L is in P and L is 
sparse. Clearly / is one-one. 

For any NP-complete language C, its image f{C) C L is also in NP, since / 
is polynomially honest, and therefore NP-complete. Hence L is not universally 
simplifying. □ 

Theorem 7. If P ^ NP, then no co-sparse language is universally polynomial. 

Proof. Suppose P yf NP. Let C be an NP-complete language and define C = 
{xw : |a;| = |w| Aw G C}. C is also NP-complete. If there is a co-sparse language 
X such that X D C G P, then we show that C G NP, a contradiction. 

Let |^~”| be bounded by some polynomial uf . The following is a polynomial- 
time algorithm for C\ on input w, enumerate the first |2w|^ -I- 1 < strings 
of length jwl, and for each such string x, check whether xw G X DC'. It is clear 
that w G C iff one such test succeeds. □ 

Corollary 2. The following are equivalent: 

1. P = NP; 

2. every co-sparse language in NP is universally polynomial; 

3. some co-sparse language in NP is universally polynomial. 

5 Conclusions 

We have characterized the class of universally polynomial context-free languages. 
Under the assumption that i) P yf NP but ii) DEXT = NEXT, an infinite uni- 
versally polynomial language in P exists. It is not known whether the second 
assumption is necessary for the existence of such sets. It would also be inter- 
esting to investigate the relationship between universally easy languages and 
NP-complete languages without an infinite P-subset. 
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Abstract. This paper gives an exponential separation on the depth of 
branching programs (BPs) between oblivions and non-oblivions BPs. 
Namely, there is a difference just like the difference between sequen- 
tial and NC computation: (i) There is a Boolean function /i of N vari- 
ables which can be computed by a polynomial-size, syntactic BP with a 
depth of 2 log N — log log N + 1 but cannot be computed by any oblivi- 
ous BPs with a depth of (2 — e)N for some e G o(l). (ii) Similarly, there 
is an /2 computed by a syntactic depth of log® N but not by an obliv- 
ious depth of n{N log N). (hi) We also show that any (unrestricted) 
BP of depth t can be simulated by an oblivious BP with a depth of 
N + \{t — log A)/(loglog Y -b C)] ■ N. The third result implies that fi 
cannot be computed by any BP with a depth less than log N + log log N 
and /2 not with a depth of o(log Y • log log Y). Note that most bounds 
in this paper include constant factors and lower-degree terms. 



1 Introduction 

One of the primary goals of theoretical computer science is to prove non-linear 
lower bounds for the size of Boolean circuits. However, in spite of serious efforts 
for several decades, we are still at a very elementary stage and are struggling 
for good research hints. Probably for this reason, several researches have been 
shifting their attention from Boolean circuits to Boolean branching programs 
(BPs): (i) BPs are also a general computation-model whose size and depth are 
naturally related to space and time [8] . (ii) It is relatively easy to consider reason- 
able restrictions which provide interesting subclasses, such as oblivious models 
discussed in this paper, for more attainable lower-bound arguments. 

Not surprisingly, therefore, we have a large literature. First of all, Ajtai [2] 
recently made a big breakthrough by showing a nonlinear depth lower bound 
for unrestricted BPs (no nontrivial lower bounds for even read-twice BPs were 
known before.). In [13], Thathachar gave an exponential separation on the size 
between k-sBPs (read-fc syntactic BPs) and {k + l)-sBPs for all k, extending 
the previous result [7] that separates only between k = 1 and k = 2. Then, 
Beame, Saks and Thathachar [10] obtained the first (again exponential on the 
size) separation result between semantic and syntactic BPs, i.e., between /c-BPs 
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and fc-sBPs, by giving a Boolean function fk which can be computed by a 2-BP 
of polynomial size but cannot be computed by any fc-sBP of polynomial size. For 
oBPs (oblivious BPs) and OBDDs, similar separation results have been known 
for longer time, including those between fc-oBPs and {k+ l)-oBPs [13], between 
fc-OBDDs and (/c-|-l)-OBDDs [9,3,1] and between 2-oBPs and [J^ /c-OBDDs [11]. 
(The separation between fc-oBPs and {k + l)-oBPs is mentioned only implicitly 
in [13].) 

In this paper, we focus our attention not on the size but on the depth of 
BPs which is an equivalently popular measure (e.g., [4,6]). Interestingly, our 
results have exactly the same flavor as typical results on parallel vs. sequential 
computation. Remember what we have usually done in the research of parallel 
computation: (i) For a given problem (or a Boolean function) Q, we design an 
efficient (poly-log depth) parallel algorithm A and then (ii) claim its efficiency by 
comparing the time complexity of the standard sequential simulation of A and 
the time complexity of the best known sequential algorithm for Q. This paper 
just follows this approach: (i) There is a Boolean function fi of N variables 
which can be computed by a syntactic BP of depth 2 log N — log log iV -|- 1 but 
cannot be computed by any oblivious BPs of depth {2 — e)N for some e G o(l). 
Also, (ii) there is an /2 which is computed by a syntactic depth of log^ N but not 
by an oblivious depth of f2{N log N). Thus, a poly- logarithmic depth is enough 
for some functions which need a polynomial (i.e., exponentially larger) oblivious 
depth. Our third result is on the simulation; (iii) any BP of depth t can be 
simulated by an oblivious BP with a depth of iV-|- [ (t— log N ) /(log log fV-kC')] -N. 
This simulation result implies that /i cannot be computed by any BP with a 
depth of less than log fV-klog log N and /2 not with a depth of o(log N ■ log log N) . 
Thus those lower bounds are somewhat close to the upper bounds in (i) and (ii) 
above, but are not tight. It should be noted that our results in this paper give 
specific constant factors and even lower-degree terms. 

As for lower-bounds on the depth of oblivious BPs, three Boolean functions 
have appeared in the literature: two have an f2{N log N) lower bound [4] and the 
third one has an fi{N lo^ N) lower bound [6]. Our second function /2 is a mod- 
ification of one of the two functions in [4]. Similar modification is also possible 
to the function in [6] and we can design an efficient syntactic BP. Unfortunately, 
we can only get an 0{N‘^) upper bound for this syntactic depth at this moment 
(see Section 5). 

It is hard to cover all the related research in this detailed abstract. Among 
others, many authors have discussed BPs which can read most variables only 
once but can read the remaining few variables many times [e.g., [12]]. OBDDs 
have a long history of research which includes a lot of papers discussing their 
practical merits for computer-aided design and hardware verification [e.g., [5]]. 

In the rest of this paper, we first give definitions and notations in the next 
section. Then we give the simulation result in Section 3, which will help under- 
standing basic differences between oblivious and non-oblivious BPs. Sections 4 is 
for the two separation results and finally we briefly mention some related results 
in Section 5. 
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2 Definitions 

Fig. 1 illustrates a simple example of a BP. A BP is an acyclic directed graph 
with a single source vertex of zero in-degree. Each sink vertex of zero out- 
degree is labeled either 0 or 1. Each non-sink vertex is labeled some variable 
Xi among xi,X 2 ,--- ,xm and has two outgoing edges labeled 0 and 1. Once a 
0/ 1-assignment to the variables x\ through a ; at is given, we can determine a 
unique computation path from the source vertex to some sink vertex labeled 0 
or 1. For example, if (xi,X 2 ,X 3 ) = (1,0, 1), the unique path ends with the sink 
vertex labeled 1 in Fig. 1. Thus a BP computes a Boolean function of N vari- 
ables. The size of a BP is the number of non-sink vertices and the depth of a 
BP is the number of non-sink vertices on the longest computation path. In this 
paper, all the BPs have a size bounded by some polynomial in N , so we often 
omit mentioning it. 




Fig. 1. example of a BP 



As shown in the example in Fig. 1, each computation path can read the 
variables in a different order. If all the computation paths read the variables in 
the same order, such a BP is called an oblivious BP. ( Unrestricted or semantic) 
BPs and syntactic BPs have no such restriction on the order of reading variables. 
Instead there is an important difference between them in the way of measuring 
the depth. See Fig. 1 again: If one looks at this BP carefully, it turns out that 
the path X 1 X 2 X 1 X 3 X 1 is not actually followed by the BP since it would mean 
that the value of xi is 0 when it is read for the first time and is 1 for the second 
time. Such a computation path is said to be not realizable. Syntactic BPs take 
into account all computation paths whether or not they are realizable and its 
depth is the length of the longest path among them. Semantic BPs (or simply 
BPs) take into account only realizable computation paths and its depth is the 
length of the longest, realizable path. 
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A BP is said to be tree-like if each vertex (other than the source) has only 
one in-degree. Any BP can be changed to a tree-like BP simply by splitting a 
vertex of in-degree k into k vertices of in-degree one from top to bottom. Note 
that the depth does not change but the size can enlarge greatly. Obviously, any 
tree-like BP has an equivalent tree-like BP such that no variable appears twice 
or more in every path. 



3 Simulation by Oblivious BPs 



Recall that the relationship between BPs and oblivious BPs is somewhat similar 
to the relationship between parallel and sequential computation. 

Theorem 1. Suppose that a Boolean function f of N variables is computed 
by a BP of depth t. Then / can be computed by an oblivious BP of depth 
N+\{t- log N ) /(log log Af -b C)1 • N. 

Proof. For a given BP Q of depth t we construct an oblivious BP P. The 
construction is basically to divide Q into blocks of depth log log N and simulate 
each of them by oblivious BPs of depth N . However, a special treatment can be 
done for the top logfV-depth portion of Q (see Fig. 2). 



non-oblivious BP oblivious BP 




Fig. 2. Simulation by Oblivious BPs 
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Lemma 1. Any BP of depth log N can be simulated by an oblivious BP of 
depth N. 

Proof. We first change the given BP, say, Mi, into a tree- like BP, M 2 , which 
includes N—1 vertices or at most N—1 variables. Suppose that M 2 looks like Fig. 
3(a). Then an equivalent oblivious BP can be constructed simply by “inserting 
missing vertices” in each path of M 2 as shown in Fig. 3(b). For example, we 
insert V 2 ,Vii, W 5 and vj into the path of viv^vq of M 2 - Each inserted vertex has 
only one out-degree and therefore the size dose not blow up. □ 

It should be noted that this construction is not always possible if the variable 
order in the target oblivious-BP is restricted. For example, if the variable order 
of the oblivious BP is the opposite of the above width-first order, then the size 
turns out to be exponential. This is the reason why we cannot divide Q into all 
logfV-depth blocks. 

Lemma 2. Let C > 0 be any constant and S = viV 2 - ■ -vn he any variable 
order. The any BP of depth log log IV -|- C whose variable set is a subset of 
{vi,V 2 , • • • , vn} can be simulated by an oblivious BP of depth N whose variable 
order is S. 



non-oblivious BP oblivious BP 




Fig. 3. Basic idea of simulation 



Proof. Since the given BP, say. Mi , is of depth log log N + C, the number 
of vertices is O(logiV), namely Mi looks at only O(logfV) variables. Hence we 
can immediately make an oblivious BP, M 2 , whose shape is a completely binary 
tree of size and whose variable order dose not contradict to S. Finally 

we can just insert missing variables as before. □ 

Now we divide the original BP Q into blocks as shown in Fig. 2. The first level 
includes vertices from the first level until the (log iV-|- l)th level, the second block 
includes vertices from the (log fV-|-l)th level until the (log A^-|-log log fV-|-C'-|-l)th 
level. (Namely, vertices at the (log IV -|- l)th level are shared by the first and 
the second blocks.) In general the /cth (fc > 3) block starts at (logN -|- (fc — 
1) (log log -I- C) -I- l)th level and ends at (log N + k{log log -I- C) -I- l)th level. 
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Thus Q is divided into one block of depth log N (vertices at the lowest level at 
each block are viewed as sink nodes and are excluded in counting the depth), 
and [(t — log A^)/ (log log + C)] blocks of depth log log A^ + C, excepting the 
last block which may be shorter. 

Each block can be viewed as a set of (a polynomial number of) BPs. Each 
of them is transformed into a tree-like BP but we do not change vertices at the 
lowest level, i.e., they can accept an arbitrary number of incoming edges. Now 
these BPs are transformed into oblivious BPs as follows: 

(1) The first block is transformed into an oblivious BP using the construction 
in Lemma 1. 

(2) For each BP in other blocks, we use the construction in Lemma 2, where 
the variable sequence S for all these BPs must be the same but may be any 
(fixed) one. 

Note that we did not change the vertices which are shared by neighboring two 
blocks (whose labels may have been changed) and therefore those transformed 
BPs are naturally combined into a single BP which is equivalent to Q. One can 
see that this BP is oblivious and its depth is N-l- [(t — log A^)/(loglog A^-kC)] - N. 

□ 



4 Separation Results 

In this section log N and log log N always denote [log A^] and [log log A^] , respec- 
tively. We introduce two Boolean functions which give an exponential separation 
on the depth between oblivious and non-oblivious BPs. 

Theorem 2. Let s be any function such that s > 2 for infinitely 

many integers A^ > 0 (e.g., e may be — G o(l)). Then there exists a 

Boolean function, fi, of N variables such that fi can be computed by a syntactic 
BP of depth 2 log N — log log A^ -k 1 but cannot be computed by any oblivious 
BP of polynomial size whose depth is (2 — e)N. 

Proof. A Boolean function /i(a;i, X 2 , • • • , xat) is defined as follows: Its 
variables xi,X 2 , ■ ■ ■ , xn are partitioned into blocks 0o,0i, • • • , dr v t , . The 

I log N I 

first block f3o consists of the first logN — log log N variables, i.e., xi through 
a^iogW-iogiogAf) and each of subsequent /3j’s {i > 1) consists of subsequent logN 
variables (the last one may be shorter again). To decide the value of /i: (i) We 
first look at block /3 q and regard its value as a binary number bo. (ii) Then we go 
to block Pbo+i and again regard its value as a binary number bi. (iii) The value 
of /i is equal to the value (0 or 1) of . 

Upper Bound. It is not hard to see that /i can be computed by a tree-like BP 
Q of depth 2 log N — log log N + 1. Its size is obviously bounded by a polynomial 
because its depth is logarithmic. Q first reads variables in Po which includes 
log N — log log N variables and due to its value Q decides which Pi of log N 
variables should then be read. Finally Q reads Xb^ and goes to a sink vertex of 
0 or 1. 

Lower Bound. Suppose that there is an oblivious BP, M, of depth (2 — e)N 
whose variable sequence is S. For simplicity we assume that the length of S is 




34 



Kazuo Iwama, Yasuo Okabe, and Toshiro Takase 



even and let Si and S 2 be the first and the second halves of S. The following 
simple lemma is important : 

Lemma 3. There are at least variables which appear only in and also 
at least other |7V variables which appear only in 82 - 

Proof. Since |S'i| = (1 — |)fV, there are at least |fV variables which do not 
appear in Si . Those variables must appear in S 2 since fi clearly depends on all 
the variables. Similarly for 82 - n 

Let us call variables only appearing in Si A-variables and variables only in 
S 2 Z-variables. Recall that |/3o| = log — log log and there are at least |A^ 

Z- variables. Therefore, some block, f3i{i > 1), contains at least 

lA^-(logA^-loglogA^) ^ 

TkigWl-i 2 s N 

Z-variables. Although details are omitted, we can show that the second term 
(Q(iog^)) jg small and does not cause any problem if we omit it. In the fol- 
lowing we do not write it just for simplicity. Select arbitrary | log N Z-variables 
in f3i and let BZi be the set of those variables. Also let BZi be the set of the 
remaining (1 — |)logA^ variables in Pi. Recall that the value assigned to Pi~ 
variables is a pointer to some variable xpl < i < N). Hence we can define the 
following equivalence classes on the set of the whole N variables: 

Ej = {xi\ the pointer to Xi (= the value i) 

has the same 0/1 assignments (= j) to BZi variables } 

Note that the number of these equivalence classes is 
and therefore at least one Ei contains —n — rr = I • A^s A-variables or more. 

Now recall that all those A-variables appear in Si which is the first half of 
the variable sequence, and any of them, Xk, can be selected by assigning some 
proper values to BZi variables. (Reason: We first Fix the Pq variables to show 
the Pi block. Then fix BZi variables so that it will show the Ej equivalence class 
which includes Xk- Now we finally fix BZi variables to show Xk-) Since those 
A-variables appear in Si and all the BZi variables appear in S 2 , the BP M 
has to remember all the values of the A-variables at the end of ^i. This needs 
vertices there, which is not bounded by any polynomial if £ satisfies the 
condition of the lemma. □ 

Theorem 3. There exists a Boolean function, / 2 , of N variables such that 
/2 can be computed by a syntactic, read-once BP of depth log^ N but needs an 
Q{N\ogN) depth to be computed by oblivious BPs. 

Proof. Before giving the definition of / 2 , we shall review the lower-bound 
result in [4]. 

Lemma 4 [4]. There is a Boolean function, f^, of N variables for which any 
oblivious BP need a depth of I2(A^log A^). 

Boolean function looks like this: The variables are xi, - ■ ■ ,Xn,yi, - ■ ■ ,yn 
and each variable takes value 0,1 or 2 i.e., they are 3-way variables. Actually, we 
use two Boolean variables for each Xi and j/i, and hence N = 4n. It is defined that 
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/I = 1 iff the 0/ 1-sequence of x\, - ■ ■ ,Xn obtained by neglecting the variables 
having value 2 is equal to the 0/ 1-sequence of yi, • • • , y„. The basic idea of their 
proof is as follows: Suppose that an oblivious BP, M, computes /| whose variable 
sequence is S and that most variables are read at least k times in S. Then it 
turns out, by a Ramsey-like theorem, that we can select Xj-variables and 

the same number of ?/i-variables for some constant c > 0 such that S is divided 
into ck blocks each of which contains only selected Xi-variables or only selected 
j/i-variables. Since all the other variables can be killed by assigning 2, a simple 
counting argument gives us 

p^cfe-l > 

where W is the maximum width at each boundary of blocks. For W to be 
bounded by a polynomial, we need that k = 12(logn), which implies the lower 
bound of the lemma. 

This argument is perfectly good for their result but a careful observation 
gives us the following improvement: 

Observation. We only need less survived variables than ones to 

obtain a superpolynomial bound. Namely log^ n ones are enough since in order 
to satisfy 

^rck-l ^ 2*°®^ " 

W needs to go beyond a polynomial if fc = o(logn). 

This observation seldom improves their result (only a constant factor), but 
it is very important in this paper. We again use a similar addressing scheme as 
before to show these surviving variables. 

Definition of the Function. /2 (-^1,-^2) is defined as follows: Both Xi and 
X 2 consist of n log N' + N' variables like 

— Qi.l^ Qi,n: ^i^N' — I5 2), 

where n = log^ N' and each Qij includes log N' variables. Note that the whole 
number N of variables is N = 2(nlog W -I- N'). When 0/1-values are assigned 
to the variables, let Sij (i = 1, 2) be the sum of the values (the binary numbers 
as before) for Qip, • • • , Qij- Then, /2 = 1 if and only if (i) the value for Qij > 0 
for all i and j, (ii) Si^n < for both i = 1 and 2, and (iii) ^ = a^2,s2 j for foi 
1 < j < ri. Namely the 0/1-sequence of xij’s which are selected by Qip, ..., Qi,n 
must be the same as the 0/1-sequence of similarly selected X 2 j’s. 

Lower Bound. We omit the lower-bound proof since it is very similar to 
Lemma 4. 

Upper Bound. Since the previous BP given in the proof of Theorem 2 had 
a logarithmic depth, it was able to be constructed as a tree-like BP. This time, 
however, tree-like BPs obviously blow up and we need a careful description for 
DAG-like BPs. To this end, we introduce what we call a “RAM-model” M for 
BPs: (i) M has a level-counter i and an internal state S of O(logA^) bits, (ii) 
Its one-step move is determined by the values of i and S, which includes reading 
some variable, and changing the values of i and S due to the current values of i 
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and S and the value of the variable just read, (iii) M may read more than one 
variable in its one-step move (multiple read). 

It is not hard to see that a program written by this rule can be immediately 
transformed to a BP by changing all different states into different vertices. Its 
size is bounded by a polynomial since the internal state has only O(logiV) bits. 
Its depth is the maximum level, but if multiple read is used then we have to 
increase it appropriately. 

Now we give our RAM-model program M which computes /2 . M reads vari- 
ables in the following order: 

Ql^l^ 1 ^2,1? ^2,S2,1 ^ ^1,25 ^l,Si_2 ’ ’ 1 ^2,nj ^2,S2,n 

Here which variable Xi^si j is depends on the value of Qij. Also, all variables 
in Qij are read in one step, which will not cause any problem since \Qij\ = 
logN' < logiV. Internal state of M has the form of S' = (Pi,P2,z), where Pi 
holds the position of variable xij which was read most recently, i.e., that is the 
value of Si j- at that moment, z needs only one bit to hold the value of Xi^si j 
until it is compared with X2,s2,y Thus, the size of S is 21 og A^' -I- 1 or 0 (log A^). 
The level counter i changes as follows: 

(1, 1)q(1, 1),(2, 1)q(2, 1),,(1, 2)q( 1, 2), • • • (2, n)Q(2, n)„ 

where (*, j)g means that it is the time when M is to read Qij and (i,j)x means 
that it is the time when M is to read ^ . Note that its initial state is (0, 0, 0). 

Now we describe a one-step behavior at each level. 

(i) The current level is (t, j)q. Read Qij. Suppose that the current state is 
(Pi,P2,z). Then if Qij = 0 or -|- Qij > N', then go to [0] (= a sync-node 
labeled by 0). Otherwise, change its state to (Pi -I- Qij,P2,z) when i = 1 and 
to (Pi, P2 + Q 2 ,j,z) when i = 2 and go to the next level. 

(ii) The current level is {l,j)x- Read a;i,Pi, change its state to (Pi, P2, xi^pQ, 
and go to the next level. 

(iii) The current level is (2,j)x- Read X2,P2 - If the current z is equal to 3:2, P2 
then if j = n then go to [ 1 ] else just go to the next level. If z yf X2 p^ then go to 
[ 0 ]. 

One can see that this program reads at most 2 n Qi j’s and 2 n Xij’s and 
hence the depth of the equivalent BP is 2 n log A^' -I- 2 n = 0 (log^ A^). (Recall that 
n = log^ N' and N = (n log N' + N').) Also this BP never reads a single variable 
twice, namely, it is a read-once BP. □ 



5 Some Remarks 

Here are a few remarks which might help: 

Remark 1. One might have noticed that there is a trivial function of N 
variables which is computed by a BP of depth log N but needs an N oblivious 
depth. One such example is the function which is computed by the BP that has 
a form of a complete binary tree and all the vertices read different variables. 
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The final value is the value of the variable which is read at the end of each path. 
Clearly this function depends on all the variables and therefore any equivalent 
oblivious BP needs the depth which is equal to the number of variables. Such a 
separation is optimal due to Theorem 1. Our real purpose in this paper is not 
to give such a single example (which is enough only to prove a separation) but 
to give more general evidence that oblivious BPs need exponentially more depth 
than their non-oblivious counterparts for a wide range of Boolean functions. 

Remark 2. By a small modification of /2 of Theorem 3, we can obtain a 
function which needs a non-linear oblivious depth but can be computed by a 
BP of depth log^’*'^ N for any constant e > 0. The idea is that if /e is a constant, 
then the width W which satisfies is not bounded by any 

polynomial. Thus the construction of is exactly the same as /2 except for 
n = log^'*’® N'. 

Remark 3. The best depth lower bound for oblivious BPs is f2{Nlog^ N) 
in [6]. The Boolean function used there also uses 3- way variables, i.e., variables 
having value 2 are all “killed.” Therefore, we can also build our addressing scheme 
into this function. Unfortunately, their lower-bound proof needs a polynomially 
many variables to survive. Thus this construction still needs 0{N^) depth to be 
computed by unrestricted BPs. 

6 Conclusion 

An apparently interesting attempt is to try to find a nontrivial separation be- 
tween oblivious and non-oblivious depth which is optimal. As mentioned before, 
our Theorems 2 and 3 are not optimal with respect to Theorem 1. Theorem 3 is 
a little far from the optimal value but Theorem 2 is much closer. One possibility 
is to reduce the size of Pi {i > 1) from logA^ to, say, (log log A^)*. If we do so, 
then it can no longer make pointers to all the variables. However, one can see 
that it would be enough to have pointers to some subset of the variables which 
contains sufficiently many A-variables. If the numbers of A- and Z-variables are 
relatively large, e.g., some ^ fraction of the total number for a constant C, this 
appears to be quite plausible. 

In this paper we were only able to give a similar type (having the address- 
ing scheme) of Boolean functions for the separation. Seeking different types of 
Boolean functions having the same separation property will be also interesting. 
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Abstract. We prove that a very basic class of program schemes aug- 
mented with access to a queue and an additional numeric universe within 
which counting is permitted accepts exactly the class of recursively solv- 
able problems. The class of problems accepted when access to the nu- 
meric universe is removed is exactly the class of recursively solvable 
problems that are closed under extensions. We build upon NSPQ(l) 
an infinite hierarchy of classes of program schemes and show that the 
class of problems accepted by these program schemes has a zero-one law 
and consists of those problems defined in any vectorized Lindstrom logic 
formed using operators whose corresponding problems are recursively 
solvable and closed under extensions. We apply our results to yield logi- 
cal characterizations of complexity classes and provide logical analogs to 
inequalities and hypotheses from complexity theory involving the classes 
NP through to ELEMENTARY. 

1 Introduction 

Descriptive complexity theory essentially began with Fagin’s Theorem [7] which 
states that a problem can be accepted by a polynomial-time non-deterministic 
Turing machine iff it can be defined in existential second-order logic. Since then, 
the relative relationships of a whole range of complexity classes and logics have 
been examined (see, e.g., [6,10]). Having logical equivalents of hypotheses in 
complexity theory enables attack using tools from finite model theory, such as 
Ehrenfeucht-Frai'sse games and zero-one laws. 

The connection between a complexity class and a logic need not be as tight 
as in Fagin’s Theorem. For example, it is known that a problem is in P (resp. 
PSPACE) iff it can be defined in inductive (resp. partial) fixed-point logic, but 
in both cases only when we work with ordered finite structures (see [6,10]). It 
turns out that on the class of all finite structures there are (computationally 
trivial) problems not definable in these fixed-point logics. Having logical charac- 
terizations of complexity classes on the class of all finite structures is preferable. 
For instance: ordered structures are more difficult to work with when applying 
the inexpressibility tools of finite model theory; in applications of descriptive 
complexity theory, e.g., to databases, it is almost always the case that finite 
structures are not ordered; and it is often undecidable as to whether a formula 
is well-formed when restricted just to ordered finite structures. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 39-48, 2001. 

(c) Springer- Verlag Berlin Heidelberg 2001 
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Whilst it remains unknown as to whether there is a logic capturing P on the 
class of all finite structures, Abiteboul and Vianu proved the following result [2,3]: 
P = PSPACE iff inductive fixed-point logic and partial fixed-point logic are 
equally expressive on the class of all finite structures. It is particularly interesting 
that a complexity-theoretic statement like P = PSPACE has an equivalent 
formulation in logic where there is a such a mismatch between the complexity 
class and the class of problems defined in the respective logic. Abiteboul, Vardi 
and Vianu [1] and Dawar [5] subsequently obtained similar results involving a 
range of complexity classes including P, NP, the Polynomial Hierarchy PH, 
PSPACE and EXPTIME. One thing all of their logics have in common is 
that they can be realised as fragments of bounded- variable infinitary logic 

In obtaining their result, Abiteboul and Vianu developed a model of compu- 
tation known as a loosely coupled generic machine (renamed a relational machine 
in [1]). Abiteboul and Vianu are not the only researchers in recent times to work 
with models of computation taking finite structures as inputs. Perhaps the best 
known such computational model is the abstract state machine of Gurevich [9] . 
Recently, the expressive power of different classes of program schemes has been 
considered. Program schemes are similar in flavour to relational machines and 
abstract state machines, and form a model of computation that is closer to the 
notion of a high-level program than a logical formula is; yet program schemes re- 
main amenable to logical manipulation. In [4,11,12,13], the computational power 
of different classes of program schemes (on the class of all finite structures) is 
compared with the expressive power of certain logics. A crucial difference be- 
tween program schemes and the traditional logics of descriptive complexity the- 
ory is that program schemes can be augmented with ‘computational’ constructs 
not really available in logic, such as stacks and arrays. 

In this paper, we augment a very basic class of program schemes NPS(l) 
with a queue and an extra ‘numeric’ universe. We prove that when both aug- 
mentations are present, the class of problems accepted by the resulting class of 
program schemes NPSQ+(1) is exactly the class of recursively solvable problems, 
i.e., the class of problems TZ£C', and when only the queue is added, the class of 
problems accepted by the resulting class of program schemes NPSQ(l) is exactly 
the class of recursively solvable problems that are closed under extensions, i.e., 
the class of problems TZ£C fl EXT. We proceed to introduce universal quantifi- 
cation into our program schemes by constructing an infinite hierarchy NPSQ of 
classes of program schemes obtained by interleaving the application of universal 
quantification with the constructs of the program schemes of NPSQ(l). It turns 
out that the class of problems accepted by the program schemes of NPSQ has 
an alternative ‘semantic’ characterization as the class of problems defined by the 
sentences of any vectorized Lindstrom logic for which the problem corresponding 
to the operator is recursively solvable and closed under extensions. It follows as 
a corollary that this class of problems has a zero-one law. We remark that by 
[11], there are problems in NPSQ(l)nNP that are not definable in 

The fact that NPSQ+(1) is computationally complete enables us to restrict 
the resources (time and space) used by these program schemes so that we can 
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logically capture many complexity classes ranging from NP through to ELE- 
MENTARY. By looking at the respective classes of program schemes where 
access to the numeric universe is removed, we move from the complexity class 
to its fragment of problems closed under extensions. In doing so, we show that 
many complexity-theoretic inequalities, e.g., NEXPTIME yf 2-NEXPTIME, 
and hypotheses, e.g., NEXPTIME yf EXPSPACE, are equivalent to these 
inequalities and hypotheses for the fragments, e.g., NEXPTIMEfliPAT yf 2- 
NEXPTIMEn^lAr and NEXPTIMEnf AT + EXPSPACEn^lAr; and 
that these fragments can be logically captured (in the sense of Gurevich [8]: 
it is not immediately obvious that a semantically restricted complexity class 
such as NEXPTIMEnfAT can be captured syntactically by a logic). This 
is interesting for the same reason that the equivalence results of Abiteboul 
at al. are interesting, i.e., there exist computationally trivial problems not in 
NEXPTIMEnfAT and EXPSPACEflfAT. Whilst our equivalence results 
are similar to those of Abiteboul et al., they involve different logics: recall, that 
there are problems in NPSQ(l)nNP not definable in C‘^^. Also, whereas the log- 
ics involved in the results due to Abiteboul el al. have a zero-one law (because 
they are fragments of our classes of program schemes define problems 

closed under extensions and so have a ‘one-law’ (apart from the program scheme 
which accepts no finite structures over some fixed signature). Not all proofs are 
included due to space limitations. 

2 Program Schemes with Queues 

Our signatures consist of a (non-empty) finite tuple of relation symbols and 
constant symbols. A problem is an isomorphism-closed class of finite structures 
over some fixed signature. If A and B are both finite cr-structures such that the 
domain of A is contained in the domain of B, i.e., |A| C \B\, and B restricted 
to |A| is A then B is an extension of A and we write A ^ B. li fl \s & problem, 
over the signature cr, for which: A and B are finite cr-structures; A ^ B', and 
A G f?, imply that B G Q then Q is said to be closed under extensions. The 
class of problems that are closed under extensions is EXT. Our structures are 
always finite. 

Program schemes are more ‘computational’ means for defining classes of prob- 
lems than are logical formulae. A program scheme p G NPS(l) involves a finite 
set {xi,X 2 , ■ . ■ , Xk} of variables and is over a signature a. It consists of a finite se- 
quence of instructions where the first instruction is ‘ input {x\,X 2 ,. . Xm ) ’ , for 
some m <k, and the variables of {x\,X 2 , ■ • ■ , Xm} are known as the input-output 
variables, with the remaining variables being the free variables. The remaining 
instructions are one of the following: an assignment instruction of the form Ai 
:= y’, where y is a variable or a constant symbol; a guess instruction of the 
form ‘guess xf', a while instruction of the form ‘while p do ap, a 2 ', . ■ . cxq 
od’, where (p is a quantifier-free first-order formula over a and where each of 
a\,a 2 , . ■ . ,otq is another instruction of one of the forms given here; or an accept 
instruction accept or a reject instruction reject. A program scheme p G NPS(l) 
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over cr with k — m free variables takes expansions A' of cr-structures with k — m 
constants as input and computes on A' in the obvious way except that: execu- 
tion of an instruction ‘guess Xi' non-deterministically assigns an element of \A'\ 
to Xi] initially, every input-output variable is non-deterministically assigned a 
value from \A’\] and if a computation encounters an accept or a reject instruction 
then the computation halts. Note that the value of a free variable never changes 
throughout a computation: the free variables appear as if they were constant 
symbols. The structure A' is accepted by p, and we write A' ^ p, iff there exists 
a computation of p such that an accept instruction is reached. We can easily 
build the usual ‘if’ and ‘if-then-else’ instructions using while instructions: hence- 
forth, we shall assume that these instructions are at our disposal. Note that the 
class of structures accepted by a program scheme of NPS(l) is a problem. 

Whereas a stack and arrays were incorporated into the program schemes of 
NPS(l) in [4,11,12,13], let us now augment these program schemes with access 
to a queue, i.e., a first-in, first-out store. In addition to the above instructions, 
we include the instructions ‘push Xi and Ai := pop’ although in this latter 
instruction we insist that Xi must be an input-output variable, whereas in the 
former it can also be a free variable. The first instruction takes the current value 
of the variable Xi and pushes this value onto the end of the queue (the value 
of Xi does not change and the length of the queue increases by 1), and the 
second takes the current value from the head of the queue and sets xt to have 
this value (with the length of the queue decreasing by 1). Initially, the queue is 
empty and if ever we attempt to pop from an empty queue then that particular 
computation is deemed not to be accepting. We denote the program schemes of 
NPS(l) augmented with a queue by NPSQ(l). 

We can also augment our program schemes with a numeric universe. That is, 
we can assume that the variables of a program scheme are of one of two types: 
variables of the first type, the element type, take values from the universe of the 
input structure (as they have done so far); and variables of the second type, the 
numeric type, take values from the numeric universe, namely {0, 1, . . . , n — 1}, 
where the input structure has size n (we assume that the universe of the input 
structure and the numeric universe are disjoint). We insist that all variables 
of numeric type must be input-output variables and that they are initialised 
to 0. Additionally, there are two constant symbols 0 and max which are always 
interpreted as the numbers 0 and n— 1. The instructions available to the variables 
of numeric type, t\,t 2 , ■ ■ ■ ,tq, say, are assignments of the form := tj\ Hi 
:= O’, Hi := max’ and Hi := tj + 1’ (if the variable tj has value n — 1 then 
execution of this instruction causes that particular computation to reject the 
input); and Hi = tj’. Hi = O’, Hi = max’ and their negations can appear as 
atoms and negated atoms in quantifier-free first-order formulae used as tests 
in while instructions (these formulae might be combinations of atoms involving 
variables of both element and numeric type). The class of program schemes 
NPSQ(l) augmented with a numeric universe is denoted NPSQ+(1). Again, 
the classes of structures accepted by the program schemes of NPSQ+(1) are 
problems. 
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Finite structures are abstract objects and as such generally have no canonical 
representation as a string over {0, 1}, z.e., as a bit-string. Nevertheless, we can 
still arrange for finite structures to be input to Turing machines by encoding 
them as strings (throughout, our Turing machines are non-deterministic). We 
say that a problem fi over cr is accepted by a Turing machine MUM accepts 
exactly those bit-strings encoding structures in 17; and so, in particular, as to 
whether an encoding eg- (A) of any cr-structure A is accepted by M is independent 
of the particular linear order of the elements chosen when encoding A. We can 
easily choose our encoding scheme so that the length of e^iA) is independent 
of the particular linear ordering chosen; and if |7l| = n then we denote this 
length by e„{n). Define TZ£C as the class of problems accepted by some Turing 
machine. Note that not every Turing machine accepts a problem and that TZEC is 
not the class of recursive languages but the class of recursive problems. However, 
any (resp. recursive) language can be realised as a problem over the signature 
cr consisting of the binary relation symbol L and the unary relation symbol B 
(resp. accepted by a Turing machine). A string over {0, 1} can be considered as 
a CT-structure A (in fact, as a class of cr-structures) whose binary relation 
describes a linear order, i.e., the bit positions of the string, and whose unary 
relation details which bits are 0 and which bits are 1. 

Throughout, we identify a class of program schemes with the class of problems 
accepted by those program schemes (the same goes for sentences of logics and 
Turing machines). 

Theorem 1. A problem is accepted by a program scheme of NPSQ_^_{1) iff it 
can be accepted by a Turing machine; that is, NPSQj^{l) = TZSC. 

Proof. Suppose that the problem f2 is accepted by some (non-deterministic) 
Turing machine M. We shall construct a program scheme p of NPSQ_|_(1) which 
simulates M. Without loss of generality, we can assume that: the Turing machine 
M has a two-way infinite work-tape and a read-only input-tape; the work-tape 
alphabet is {0, 1, b}; and the input-tape has a left-hand marker symbol I and a 
right-hand marker symbol r which delimit the input string. 

Essentially, the queue of our program scheme p will hold a description of 
the tapes of the Turing machine. We start with the work-tape. Let u and v be 
distinct elements of an input structure (we can guess these elements as the first 
act of our program scheme) . In our description of the work-tape, we encode the 
work-tape symbol 0 as the triple {u,u,u), the work-tape symbol 1 as the triple 
(v,v,v) and the blank symbol b as the triple (u,u,v); and we use the triples 
(w, u, u) and (v, v, u) as delimiters. Let the work-tape at some instantaneous 
description (ID) of M be of the form w\,W 2 , ■ ■ ■ ,Wi, . . . , Wm, reading from left 
to right where wi (resp. Wm) is the first blank symbol to the left (resp. right) of 
the work-tape head beyond which the work-tape is entirely blank. Furthermore, 
suppose that the work-tape head is scanning the symbol Wi. The queue will 
consist of: the triple encoding wi; the triple encoding W 2 ] ■ ■ ■', the triple {v,u,u) 
(to denote that the head is scanning the next symbol); the triple encoding wp 
. . .; the triple encoding Wm,; and the triple (v,v,u) (to denote that we have 
reached the end of our description of the work-tape). We shall describe the state 
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of the Turing machine M in any ID using a constant number of variables of 
our program scheme p. Also, should our program scheme p know which move 
of the Turing machine M to simulate and the effect upon the work-tape and 
the work-tape head, we could easily simulate this move by making appropriate 
manipulations of the queue. 

Consequently, we are left with the problem of dealing with the input tape. 
This is more complicated as initially our program scheme is not given an encoding 
of the input structure A, which has size n, say: it simply has access to the raw 
finite structure itself. However, we can build our own encoding of A. Prior to 
any simulation of the Turing machine M, our program scheme guesses a linear 
ordering on |A| and stores this linear ordering on the queue. Note that because 
we have access to the (disjoint) domain {0, 1, . . . , n — 1}, the capacity to count 
using the elements of this domain and the integers 0 and n — 1 (through the 
constants 0 and max), we can ensure that our guessed linear order contains 
every element of |A| exactly once. 

Suppose that our guessed linear order is uq,u\,. . . ,u„-i. This linear order 
defines a concrete encoding of our input structure and it is this encoding which 
we assume is presented to the Turing machine M. We use our linear order to 
write a description of the input-tape of M on our queue. 

It is now straightforward, given descriptions of the input-tape and the work- 
tape on the queue, to simulate the computation of M . The converse, that any 
problem in NPSQ^(l) is in TZSC, is trivial. □ 

The question remains as to what sort of problems can be accepted by program 
schemes of NPSQ(l). 

Theorem 2. A problem is accepted by a program scheme of NPSQ{1) iff it 
can be accepted by a Turing machine and is closed under extensions] that is, 
NPSQ{1) = nSCASXT. 

Proof. Let A and B be cr-structures such that AQB.li A\s accepted by some 
program scheme p of NPSQ(l) then we can mirror an accepting computation in 
a computation of p on (essentially, because our tests in while instructions are 
quantifier- free) . Hence, NPSQ(l) C TZSC C\SXT- 

Conversely, suppose that 17 is a problem, over the signature cr, in TZSCdSXT- 
In particular, 17 is accepted by some Turing machine M. Construct a program 
scheme p' of NPSQ(l) as in the proof of Theorem 1 except that when the linear 
order is guessed, the checks made are that every element appears in the linear 
order at most once and that every constant appears in the linear order. Whenever 
p' makes a guess, include code to ensure that the guess is always an element in 
the domain of the linear order. 

Suppose that the cr-structure A is in 17; that is, every encoding of A is 
accepted by M. Then there is a computation of p' on input A which guesses a 
linear order whose domain is the whole of |7l|; and as a result, this computation 
of p' on input A simulates that of M on the respective encoding of A. Thus, A 
is accepted by p' . 




Program Schemes, Queues, the Recursive Spectrum and Zero-One Laws 



45 



Suppose that the cr-structure A is accepted by p'. Then there is an accepting 
computation where this computation guesses a linear order whose elements come 
from the subset B of |^|. Let B be the restriction of ^ to i? (and so A is 
an extension of B). The accepting computation of p' on input A simulates an 
accepting computation of M on input the respective encoding of B (because any 
guess results in an element of B). Hence, this encoding of B is accepted by M 
and thus B G fl. But 17 is closed under extensions and so A G fi. The result 
follows. □ 

3 Zero-One Laws 

We can extend our class of program schemes NPSQ(l) to a hierarchy of classes of 
program schemes essentially by interleaving applications of universal quantifiers 
with the basic constructs of the program schemes of NPSQ(l). In more detail, 
assume that we have defined a class of program schemes NPSQ(2m — 1), for 
some TO > 1, and that any program scheme has associated with it: a set of 
input-output variables; a set of free variables; and a set of bound variables. 

Definition 1. Let the program scheme p G NPSQ(2m — 1) be over the sig- 
nature a, and suppose that p has: input-output variables X\,X 2 , ■ ■ ■ ,Xk; free 
variables Xk+i,Xk+ 2 , ■ ■ ■ ,Xk-i-s; and bound variables Xk+s+i,Xk-\-s-\- 2 , ■ ■ ■ ,Xk+s-i-t- 
Let Xi^, Xi^, . . . , Xi^ be free variables of p. Then 'ixifixi^ . . . 'ixi^p is a program 
scheme of NPSQ{2m), which we denote by p' , with: no input-output variables; 
free variables those of {xk+i,Xk+ 2 T ■ ■ ,Xk+s} \ {xi^Xi^, ■ ■ ■ ,Xi^}; and the re- 
maining variables of {xi,X 2 , ■ ■ ■ ,Xk+s+t} as its bound variables. 

A program scheme such as p' takes expansions A' of a -structures A by ad- 
joining s — p constants as input (one for each free variable), and p' accepts such 
an expansion A' iff for every expansion A” of A' by p additional constants (one 
for each variable Xi., for j G {1,2, ... ,p}J, A" |= p (the computation on such 
an expansion A” always starts with the queue empty). □ 

Definition 2. A program scheme p' G NPSQ{2m—l), for some to > 2, over the 
signature a, is defined exactly as is a program scheme of NPSQ{1) except that the 
test in any while instruction is a program scheme p G NPSQ(2m — 2). The bound 
variables of p' consist of the bound variables of any test in any while instruction; 
all free variables in any test in any while instruction are input-output or free 
variables of p' ; and there may be other free and input-output variables (appearing 
in p' at the ‘top level’ but not in any test). Of course, any free variable never 
appears on the left-hand side of an assignment instruction, in a guess instruction 
or in a pop instruction in p' . 

Suppose that a program scheme p' G NPSQ{2m — 1) has s free variables. 
Then it takes expansions A' of a-structures A by adjoining s constants as input 
and computes on A' in the obvious way; except that when some while instruction 
is encountered, the test, a program scheme p G NPSQ {2m — 2), is evaluated 
according to the expansion of A' by the current values of any relevant input- 
output variables of p' . Ln order to evaluate this test, the queue associated with p 
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is initialised as empty and when the test has been evaluated the computation of 
p' resumes accordingly with its queue and input-output and free variables being 
as they were immediately prior to evaluation. □ 

So, we obtain a hierarchy NPSQ(l) C NPSQ(2) C ... C U{NPSQ(i) : i = 
1,2,...} = NPSQ. Similar hierarchies have been defined previously in [4,11,13] 
but based around stacks and arrays and not queues. We omit the proofs of the 
following two results but remark that the proof of the latter result follows from 
the proof of the former. 

Theorem 3. The class of program schemes NPSQ has a zero-one law. □ 

Then the logic formed by extending first-order logic FO with a vectorized 
sequence of Lindstrom quantifiers corresponding to the problem 17 is denoted 
(±l7)*[FOj: see [6] for definitions. Such a logic is a vectorized Lindstrom logic. 

Corollary 1. NPSQ = 1J{(±17)*[F0] : Q G TISC n EXT}. □ 

Essentially, Corollary 1 equates the ‘syntactically-defined’ class of problems 
NPSQ with the ‘semantically-defined’ class of problems definable using first- 
order constructs in tandem with ‘recursive extension-closed’ operators. Whilst 
NPSQ is a natural sub-class of a computationally complete class of program 
schemes, there are problems in TZEC which have a zero-one law but which are 
not in NPSQ, one such being the ‘Hamiltonian cycle’ problem. 

4 A Simple Application 

It is a question of great importance in finite model theory and database theory 
as to whether there is a logic which captures exactly the class of polynomial- 
time solvable problems. Of course, one has to be precise about what one means 
by a logic and Gurevich [8] has formulated a definition which has been widely 
adopted. A logic L is given by a pair of functions {Sen, Sat) satisfying the 
following conditions. The function Sen associates with every signature a a re- 
cursive set Sen{a) whose elements are called L-sentences over a. The function 
Sat associates with every signature a recursive relation Sato-{A,(p), where A is 
a cr-structure and v? is a sentence of L. We say that A satisfies (p if Sat^{A, p) 
holds. Furthermore, we require that Sata{A,(p) iff Sata-{B,(p) when A and B 
are isomorphic cr-structures. As yet no-one has exhibited a logic (in Gurevich’s 
sense) to capture any mainstream complexity class (for which the expectation 
is that it is properly) contained in NP. Logical characterizations of complexity 
classes on the class of ordered finite structures usually suffer from the difficulty 
that deciding whether a formula of the logic is ‘order invariant’ is undecidable. It 
is straightforward to verify that the classes of program schemes NPSQ (together 
with the classes therein) and NPSQ+(1) are logics in the sense of Gurevich; 
and so, for example. Theorem 2 implies that the class of recursively-solvable 
problems that are closed under extensions can be logically captured. However, 
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by imposing resource conditions upon our program schemes, we can logically 
capture other classes of problems. 

A problem 17, over some signature a, is accepted by a non-deterministic Tur- 
ing machine M in time f{x) (resp. using space g{x)) if M accepts 17 and for 
every encoding ecr{A) of any a-structure A of size n in 17, there is an accepting 
computation of M on input e^iA) taking time at most /(ecr(n)) (resp. using 
space at most g^e^in))). In contrast, the time and space complexities of a pro- 
gram scheme of NPSQ+(1) or NPSQ(l) on some input structure are measured 
in terms of the size of the input structure. In order to evaluate the time taken 
or the space used in some computation of a program scheme of NPSQ+(1) or 
NPSQ(l) on some input structure of size n, we treat the execution of every as- 
signment, quantifier-free test, pop, push, etc., as taking one unit of time, and 
we regard a variable or a place on the queue as occupying one unit of space. As 
for a non-deterministic Turing machine, the complexity of a program scheme of 
NPSQ+(1) or NPSQ(l) is derived using accepting computations only. 

For any function f {x), let VOCy{f{x)) denote {po(a^)-(/(<Zo(a^)))^ +Pi(x). 
{f{qi{x)))''~'^ + ---+Pk-i{x).f{qk-i{x))+Pk{x) : k > 0,po,pi, . . . ,pk,qo,qi, ■ ■ ■ , 
Qk-i are polynomials with integer coefficients}. If C is a class of functions then 
VOCy{C) = U/(a;)eC 'k’^t^y{f{x)). Denote the class of non-deterministic Turing 
machines that accept problems in time (resp. using space) f{x) where f{x) is 
a function from the class of functions C as NTM*(C) (resp. NTM®(C)), with 
NPSQ*(1)(C) and NPSQ(,_(1)(C) (resp. NPSQ"(1)(C) and NPSQ;(1)(C)) defined 
likewise. The proofs of Theorems 1 and 2 yield the following. 

Corollary 2. For any class of functions C: 

- NTM\VOCy{C)) = NPSQ\{l){VOCy{C)) 

- NTM\VOCy{C))C^£XT = NPSQ\l){VOCy{C)) 

- NTM%VOCy{C)) = NPSQl{l){VOCy{C)) 

- NTM^ro£y{c)) r\ £xr = NPSQ‘^{i){vo/:y{c)) . □ 

We can regard NPSQ*(l)(PO£3^(C)), for example, as a logic (in Gurevich’s 
sense) by identifying program schemes from this class with pairs (p, F(x)), where 
p is a program scheme of NPSQ(l) and F{x) is a function from PO£y{C). Of 
course, this requires that the functions of C form a recursive set and that there 
is a Turing machine which when given a function f{x) from C and a positive 
integer n, computes the value /(n). Thus, Corollary 2 can be used to logically 
capture a whole host of complexity classes including NP, P SPACE, NEX- 
PTIME, EXPSPACE, 2-NEXPTIME, 2-EXPSPACE, . . ., ELEMEN- 
TARY. However, Corollary 2 can also be used to logically characterize ‘frag- 
ments’ of (the above) complexity classes obtained by intersecting the complexity 
class with the class of problems SXT. Note that this is a ‘semantic’ rather than 
a ‘syntactic’ restriction and so it is not immediately obvious as to whether such 
a fragment of a complexity class can be logically captured. Furthermore, existing 
complexity-theoretic inequalities or hypotheses can be used to derive analogous 
ones for the respective fragments of the complexity classes in question. 
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Applying Corollary 2, we know that NEXPTIME ^ 2-NEXPTIME. 
Corollary 2 implies that NEXPTIME n EXT + 2-NEXPTIME n EXT and 
that the class of problems accepted by program schemes of NPSQ(l) restricted 
to run in time 2^*’*' \ for some polynomial p{x), properly contains the class of 
problems accepted by program schemes of NPSQ(l) restricted to run in time 
2P(^)^ for some polynomial p{x). Also, it is open as to whether NEXPTIME = 
EXPSPACE, which, by Corollary 2, is equivalent to whether NEXPTIME fi 
EXT = EXPSPACE nfAT and to whether the class of problems accepted by 
program schemes of NPSQ(l) restricted to run in time for some polyno- 

mial p{x), is the same as the class of problems accepted by program schemes of 
NPSQ(l) restricted to run using space for some polynomial p{x). 

We feel that it is interesting that open complexity-theoretic questions are 
equivalent to analogous logical questions involving classes of problems with zero- 
one laws, and even closed under extensions. Of course, these latter classes of 
problems do not contain some computationally-trivial problems, yet still they 
have a definitive role to play in the question of whether two complexity classes, 
such as NEXPTIME and EXPSPACE which have significant complexity- 
theoretic capacities, are identical. 
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Abstract. Karp and Lipton, in their seminal 1980 paper, introduced the 
notion of advice (nonuniform) complexity, which since has been of central 
importance in complexity theory. Nonetheless, much remains unknown 
about the optimal advice complexity of classes having polynomial advice 
complexity. 

In particular, let P-sel denote the class of all P-selective sets [23] For 
the nondeterministic advice complexity of P-sel, linear upper and lower 
bounds are known [10]. However, for the deterministic advice complex- 
ity of P-sel, the best known upper bound is quadratic ]13], and the best 
known lower bound is the linear lower bound inherited from the nonde- 
terministic case. This paper establishes an algebraic sufficient condition 
for P-sel to have a linear upper bound: If all P-selective sets are asso- 
ciatively P-selective then the deterministic advice complexity of P-sel is 
linear. (The weakest previously known sufficient condition was P = NP.) 
Relatedly, we prove that every associatively P-selective set is commuta- 
tively, associatively P-selective. 



1 Introduction 

Selman [23,24,25,26] defined the P-selective sets about twenty years ago. In addi- 
tion to being of interest in their own right, they have recently had some surprising 
applications. For example, selectivity is a powerful tool in the study of search ver- 
sus decision problems [4] , and nondeterministic generalizations of selectivity are 
the key tools used to show that even NP machines cannot uniquely refine satisfy- 
ing assignments unless the polynomial hierarchy collapses [8], that even weaker 
refinements are also precluded unless the polynomial hierarchy collapses [21,18], 
and that many cardinality types of nondeterministic function classes cannot col- 
lapse unless the polynomial hierarchy collapses [9]. 
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Definition 1 ([23]). A set B is B -selective if and only if there is a (total) 
polynomial-time function / : 17* x ^ Z’* such that 

1- {^x,y)[f{x,y)=xy f{x,y)=y\. 

{'^x,y)[{x & By y & B) f{x,y) & B]. 

We call such an f a F -selector function for B. 

That is, a set B is P-selective if there is a polynomial-time function / that 
given any two strings always chooses one of them, and / does this in such a way 
that if exactly one of the strings is in B then the P-selector function chooses 
that string. f{x,y) is often described, very informally, as choosing one of a; or y 
that is “more likely (actually, no less likely) to be in the set.” 

The P-selective sets have been extensively studied, and much about them 
is well understood. Though some P-selective sets are very complex — highly 
undecidable — the P-selective sets nonetheless have a broad range of structural 
simplicity properties. Most crucially in terms of the study in this paper, Ko [13] 
showed that they have low nonuniform complexity (P-sel C P/0(n^)). A few 
among the many other simplicity results known to hold are: Ko and Schoning [15] 
showed that all P-selective sets in NP are in the second level of the low hierar- 
chy of Schoning [22], and Allender and Hemachandra [1] showed that the Ko- 
Schoning result is the strongest lowness result for P-sel that holds with respect to 
all oracles; a long line of work starting with Selman [23] and Toda [28] has shown 
that no P-selective set can be NP-hard under <[]j or various other reductions 
unless P = NP; Naik and Selman [19] have shown that no P-selective set can be 
truth-table-hard for NP unless certain (intuitively unlikely) containments hold 
in the relationship between adaptive and nonadaptive queries to NP; and as a 
consequence of the work of Ko [13] and Kobler and Watanabe [16], no P-selective 
set can be truth-table-hard for NP unless the polynomial hierarchy collapses to 
ZPpNP, where ZPP as usual denotes expected polynomial time. Though much 
is known about the P-selective sets and their refinements and generalizations, 
including quite current work (e.g., [20]), much remains open. 

In this paper, we show that P-selector functions with nice algebraic properties 
have simplicity properties far beyond those known to hold for general P-selective 
sets. More generally, we study the class of languages one obtains from P-selector 
functions having algebraic properties, and the possibility of obtaining such alge- 
braically nice P-selector functions. 

In particular, Section 3 shows that any P-selective set having an associative 
P-selector function also has a commutative, associative P-selector function. It 
follows that sets having an associative P-selector can be accepted by determin- 
istic advice interpreters using only a linear amount of advice. In contrast, the 
best upper bound on the deterministic advice complexity of the P-selective sets 
is the quadratic bound obtained by Ko [13]. 

Our result provides a new sufficient condition — all F-selective sets are asso- 
ciatively F-selective — for all F-selective sets having linear deterministic advice; 
the weakest previously known sufficient condition was the extremely demanding 
assumption that P = NP. 
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Section 4 shows that associatively P-selective sets with weak census functions 
can never be coNP-immune. Section 5 establishes a structural sufficient condition 
for all P-selective sets being associatively, commutatively P-selective. 

2 Definitions 

A binary function / is associative if and only if, for all a, b, and c, it holds that 
/(a, f{b, c)) = b),c). A binary function / is associative at each length if, for 

all a, b, and c satisfying |a| = |6| = |c|, it holds that /(a, f{b, c)) = /(/(o, b),c). A 
binary function / is commutative if and only if, for all a and b, f{a, b) = f{b, a). 
We will use the following notational shorthands for the classes that we will study. 
(The ordering of “A” and “C” changes to avoid confusion with the existing class 
“AC,” and to make clear that the modifies just the “A.”) 

Definition 2. 1. P-sel = {B \ B is V -selective}. 

2. A-sel = {B I B is V-selective via an associative V- selector function}. 

3. C-sel = {B I B is P-selective via a commutative P -selector function} . 

4 . CA-sel = {B I B is P-selective via a commutative, associative P-selector 
function} . 

5. A^-sel = {B I B is P-selective via a P-selector function that is associative 
at each length}. 

6. A^C-sel = {B I B is P-selective via a commutative P-selector function that 
is associative at each length}. 

From these definitions, CA-sel C A-sel C A^-sel C P-sel, CA-sel C A^C-sel C 
Af-sel C P-sel, and A^C-sel C C-sel C P-sel. The following fact is well-known. 

Fact 1 P-sel = C-sel. 

The fact holds as if B is P-selective via P-selector function /, then f{x, y) = 
max(/(x, y), f(y, x)) is a commutative function that remains a P-selector for B. 

Advice classes capture the information content of sets with respect to some 
class of decoder (usually P) . 

Definition 3 ([12]). 

1. Let C be any complexity class and f be any function mapping from N to N. 
A set B is in Cj f(n) if and only if there exist a set C € C and a function 
h : 1* ^ E* such that for all x £ S* , 

(a) |/i(fl^l)| = /(|x|), and 

(b) x£B ^ (x,/i(fl^l)) G C. 

2. For any complexity class C and any function class T,CjT = [J C/ f(n). 

In this paper, we will be particularly interested in P jO(r?'), P/0(n), P /n-\-l, 
and P/n. 

Let T be any function complexity class. A set B is iF-printable if there is 
a function g £ T such that, for all x, g{x) outputs (in some fixed, standard 
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way of encoding sets as strings) B fl U U • • • U [3]. That is, there 
is a function from that finds all elements in the set up to a given length. By 
tradition, P-printable denotes the FP-printable sets. 

As is standard, FP is the class of all (total) polynomial-time computable func- 
tions. FP'^^ denotes the class of all functions that are computable in polynomial- 
time with the help of an NP oracle. 

3 Associativity Drops Deterministic Advice Complexity 
to Linear 

Are all associatively P-selective sets commutatively, associatively P-selective? 
If they are, this will not only collapse two of our classes, but also will allow 
us to use results from tournament theory in our study of associative selector 
functions (since, as we will discuss later, commutative selector functions have 
a close relationship to tournaments). Unfortunately, Fact 1 does not say that 
A-sel = CA-sel. The reason it does not guarantee this is that it is conceptually 
possible that the transformation from / to /' that is stated just after Fact 1 
will, while gaining commutativity, destroy associativity. The following theorem, 
our main result, states that this is never the case. (All proofs here omitted due 
to space can be found in [6].) 

Theorem 2. A-sel = CA-sel. 



Theorem 3. A^-sel = A^C-sel. 

Ko showed the following result. 

Theorem 4 ([13]). P-sel C P/0{n^). 

That is, deterministic advice interpreters given quadratic advice accept the 
P-selective sets. It is also known that nondeterministic advice interpreters with 
linear advice accept the P-selective sets, and this is optimal. 

Theorem 5 ([10]). P-sel C NP/n-|-l but P-sel 2 NP/n. 

It is natural to wonder whether those results can be unified into the very 
strong claim: P-sel C P/0(n). Currently, no proof of this claim is known and 
people doing research on selectivity generally doubt that it holds. However, prov- 
ing the claim false seems unlikely in the immediate future, as by Theorem 5 any 
such proof would implicitly prove P yf NP. 

Nonetheless, the following result shows that all associatively P-selective sets 
are accepted by deterministic advice interpreters with linear advice. 

Theorem 6. A^-sel C P/n-|-l. 



Corollary 1. A-sel C P/n-|-l. 
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Proof of Theorem 6: Let B G A^-sel. Let / be a commutative P-selector 
for B that is associative at each length. By Theorem 3 we know that such a 
commutative P-selector for B must exist. 

Let n e N. Consider the directed graph G„ = where = 

{{x,y) I x,y G A x ^ y A f{x,y) = y}. G„ is a tournament — a directed 
graph such that between any two nodes u, u, u ^ v, there is exactly one of the 
edges (u,v) or (v,u). 

En is a transitive relation. To see this, let x,y,z G i?^”. Suppose {x, y) G i?„ 
and (y, z) G E„. Note that this implies x ^ y, y ^ z, and x ^ z. It also implies 
that f{x,y) = y and f{y,z) = z. Thus, f{f{x,y),z) = z and f{x,f{y,z)) = 
f{x, z). Since / is associative at each length we have f{f{x, y),z) = f{x, f{y, z)) 
and so f{x, z) = z, which implies (x, z) G En- Thus, E„ is transitive. 

Tournaments such that their edge set is a transitive relation are called tran- 
sitive tournaments. We will use the following result (which can be viewed as a 
tournament-language expression of the fact that each finite linear order has a 
minimal element). 

Proposition 1 ([17]). The following statements are equivalent for tournaments 
G={V,E): 

1. G is a transitive tournament. 

2. G contains no directed cycles. 

3. Every induced (directed) subgraph G' = (V',E'), V' yf 0, of G contains a 

(distance-one) source node, i.e., a node u G V such that for all v G V , 

V ^ u, (u, v) G E' . 

Since / is associative at each length, we have for all n G N that Gn is a 
transitive tournament, which in turn by Proposition 1 implies that (if G„ is not 
the empty tournament) G„ contains a source node s„. In other words, for all 
n G N, if yf 0 then there exists a string s„ G such that for all x G 
f{x, Sn) = X. Since s„ G i?^", clearly for each length n string z G B we have 
f (-^j "^n) ^n- 

Define, for all n G N, /i(l") = ls„ if yf 0, and ft.(l") = 0"+^ otherwise. 
Note that h on any input 1” outputs a string of length exactly n -I- 1. Let 
L = {(a;,ly) | |a;| = |y| A f{x,y) = x}. Clearly, L G P. Due to the above 
construction we also have, for all x G S* , 

xGB ^ (a;,/i(ll^l)) G L. 

This shows that B G P/n-|-l. 

It remains to show that P/n-|-I — A^-sel yf 0. In fact, it clearly holds even that 
P/1 — P-sel yf 0. The proof of this claim consists of a diagonalization against all 
P-selectors. Let /i,/ 2 ,--- be an enumeration (not necessarily effective, though 
in fact there do exist such effective enumerations) of all polynomial-time selector 
functions (i.e., all polynomial-time functions such that for all x and y it holds 
that f{x,y) G {x,y}). Set B = | /i(0^*“^, 0^*“^) = 0^*“^} U | 

0^*“^) = 0^*“^}. B is not P-selective, yet B G P/1. □ 
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Regarding the previous result, note that, more generally, if a set A and a 
(not necessarily commutative) P-selector / for it have the property that at each 
nonempty length m there is some string y in the set at that length — or even at 
some length linearly related to that length — such that f{y, z) = z or f{z, y) = z 
for each z of length m, then A is in P/0(n). However, among ways of ensuring 
that this condition is met, we feel that associativity is a particularly natural and 
well-structured property to study. 

The number of advice bits in Theorem 6 and Corollary 1, n -I- 1, is optimal. 
Theorem 7. A-sel jn. 

Corollary 2. A^-sel % P/n. 

In fact, for any recursive function /, A-sel 2 DTIME[/(n)]/n. The reason 
for this is that we may take the P-selective set that Hemaspaandra and Toren- 
vliet [10] prove is not in DTIME[/(n)]/n and, just as in the proof of Theorem 7, 
may complete its P-selector function in a manner that achieves associativity. 

We mention in passing that — since log2((2”+^ — 1)-|-1) = n-|-l and thus one 
can include in the advice-seeking tournament all lengths up to the current one — 
the n -|- 1 advice bounds in Theorem 6 and Corollary 1 hold even in the “strong 
advice” model of Ko, Balcazar, and Schoning [14,2] (see also the discussion 
in [10]). 

We note that our results relativize. In particular, one can relativize them by 
any particular NPflcoNP set, and so regarding associativity and the NP-selective 
sets (equivalently the NPSVj-selective sets and equivalently the ppNPncoNP_ 
selective sets, see [8,7] for definitions and discussion), it follows from our results 
that all length-associatively NP-selective sets are in (NP ricoNP)/n-|- 1. In fact, 
essentially all of this paper’s results can be stated and proven more generally so 
as to apply even to a variety of nondeterministic selectivity classes; see [5] for 
full details. 

4 Printability and Nonimmunity 

Associativity yields additional simplicity properties. Let us consider nonimmu- 
nity results (i.e., presence of infinite subsets). For general P-selective sets. Theo- 
rem 8 holds. In contrast, for associatively selective sets, we have Theorem 9. To 
see how these bounds relate, note that UP'^^ C NP^^. 

o^tvjtdNP 

Theorem 8. Every infinite ^-selective set B has an infinite FP ® -print- 
able subset. 

Theorem 9. Every infinite A-selective (or even Ai-selective) set B has an in- 
finite FP®®UP -printable subset. 

Theorem 8 should be contrasted with the result of Hemaspaandra, Ogi- 
hara, Zaki, and Zimand (personal communication, 1999) that P-sel is not (weak- 
ppNP -rankable)-immune, a result related to the study in [1 1] . 
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Proof of Theorem 9: Let B be an infinite A^-selective set. Let / be a com- 

mutative P-selector of B that is associative at each length. By Theorem 3 we 
know that such a selector for B exists. 

Consider the function score that is defined as score{x) = \\{z G | f{x, z) 
= x}||. Note that for each n G N and for each x G A", (a) x ^ B score{x) < 
2” — ||i?^"|| and (h) x G B score(x) > 2” — ||i?^”|| -I- 1. So for all x G i? 
and all y ^ i? such that |x| = \y\, score (y) < score (x). It follows from the 
associativity-at-each-length of / that for all n, max{score{y) \ \y\ = n} = 2” (see 
also Proposition 1 and the comment immediately after it, applying both to the 
tournament on 27" defined by the associative-at-each-length selector function /) . 
So for each n, there exists exactly one string at length n, call it d„, at length n, 
such that score(dn) = max{score{y) \ |j/| = n} = 2". Thus, the set 

F ={{!"', z) I n G N A 2 : G 27* A (3w : |w| = n — |z|)[score(za;) = 2"]} 

is in UP^^. So, given 1", computing dn can be done in polynomial time with 
the help of the oracle F. Let C = B D {di \ i > 0}. Clearly, as B is infinite, and 
C is both infinite and -printable. □ 

If one cares about the number of queries to B needed in the printability 
claims of Theorems 8 and 9 the following can be observed. As communicated to 
us by Till Tantau [27], one can limit one’s queries to B to a logarithmic number 
via putting the queries into a Toda-like ordering and binary searching to find 
which are in and which are out. 

It follows from the proof of Theorem 9 that any associatively selective set for 
which there is an infinite set of lengths at which we know it is not empty is in 
fact non-coNP-immune. This is a very weak type of partial census information. 

Definition 4. We say a set B is hintable if there is a tally set, TCI*, such 
that T gV and (Vi)[P G T ^ ||B=*|| ^ 0]. 



Theorem 10. Any infinite, hintable A-selective (or even Ai-selective) set has 
an infinite coNP subset (i.e., A^-sel fl Hintable is not colTP -immune) . 

5 Are All P-Selective Sets Associatively P-Selective? 

Many of our results give simplicity properties of A-sel. It is natural to wonder 
whether in fact A-sel is all of P-sel. We will prove, as Corollary 3, that if P = NP, 
then these classes coincide. 

Theorem 11. Every F-selective set has a selector function in FP'^^ that is 
commutative and associative. 

Proof: Let A be a P-selective set. Let / be a P-selector of A, without loss of 
generality assume / to be commutative. For every pair of strings x and y, lo is 
called a connector of x and y if and only if either (a) f{x, to) = to and /(w, y) = y 
or (b) /(cc,a>) = x and f{ui,y) = uj. Note that for any two strings x and y, there 
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always exist connectors of x and y, namely x and y. Define the function g by 
setting, for all x and y, 

{ y if the lexicographically smallest connector w of x and y 
yields /(x,w) = u and f(uj,y) = y, 

X otherwise. 

It is not hard to see that g is an function. We will now argue that 

(/ is a commutative and associative selector for A. By definition, g is clearly 
commutative. 

We will show that g is a selector for A by showing that for any two strings x 
and y, g{x,y) satisfies the conditions (1) g{x,y) = x V g{x,y) = y and (2) (x G 
AV y £ A) g{x, y) £ A (see Definition 1). Let x and y be any two strings. 
Condition (1) holds by definition oi g.li x = y or /(x,y) = g{x,y) then also 
(2) is trivially satisfied. So suppose x ^ y and f{x,y) ^ g{x,y). Without loss 
of generality assume f{x,y) = x and g{x,y) = y. Hence the (lexicographically) 
smallest connector of x and y, call it to, yields f{x,uj) = oj and f{uj,y) = y. 
Since /(x, y) = x, /(x, to) = to, and /(w, y) = y it follows from the fact that 
/ is a selector of A that either x,y,u> £ A or x,y,LO ^ A. Hence (2), i.e., 
{x £ Ay y £ A) g{x, y) G A, holds. 

It remains to show that g is an associative function. Assume that g is 
not associative. Hence there exist three disjoint strings x, y, and z such that 
g{x,g{y,z)) yf g{g{x,y), z). (Note that it x = y or y = z or x = z then 
g{x, g{y, z)) = g(g(x, y),z).) It is not hard to see that g(x, g{y, z)) yf g{g{x, y),z) 
is equivalent to 

{g{x, y) = y A g{y, z) = z A g{x, z) = x)V 

{g{x, y)=x A g{y, z) = y A g{x, z) = z) (*). 

Informally put, g{x, g{y, z)) y^ g{g{x,y),z) is equivalent to the statement that 
X, y, and z induce a directed cycle of length three in the (infinite) tournament 
induced by g. Let ui be the lexicographically smallest connector among all con- 
nectors of X and y, among all connectors of y and z, and among all connectors 
of X and z. Let a,b £ {x,y,z}, a yf 6, be such that w is a smallest connector 
of a and b. Without loss of generality assume g{a,b) = b. Hence f{a,oj) = oj 
and f{uj, b) = b. Let c be the string in {x, y, z} \ {a, b}. If f(c, uj) = c then co is 
also the (lexicographically) smallest connector of a and c and thus g{a, c) = c, 
contradicting (*). If /(c, w) = lo then uj is also the (lexicographically) smallest 
connector of b and c and thus g{b,c) = b, contradicting (*). This shows that g 
is associative and completes our proof. □ 

Corollary 3. P = NP P-sel = A-sel. 

Theorem 6 provides a sufficient condition, based on an algebraic property for 
selector functions, for P-sel C P/0(n). If one were interested only in structural 
complexity-class-collapse sufficient conditions. Theorem 6 would be no improve- 
ment over the P = NP sufficient condition implicit in the result of Hemaspaandra 
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and Torenvliet that P-sel C NP/0(n) (and thus P-sel C V /0{n) if P = NP), as 
in light of Corollary 3 the best known structural complexity-class-collapse con- 
dition sufficient to imply P-sel = A-sel is also the collapse P = NP. However, we 
feel that this is the wrong view, and that “P-sel = A-sel” is probably a funda- 
mentally different type of assumption than P = NP. For example, if P-sel = A-sel 
were in fact equivalent to P = NP, then P = NP would (trivially) be not just a 
sufficient condition for P-sel = A-sel but also would be a necessary condition. In 
fact, not only is P = NP not known to be necessary for P-sel = A-sel, but in fact 
no structural complexity-class-collapse condition — not even very weak collapses 
like P = UP or P = ZPP — is known to be necessary. Our point here is that, 
though by Corollary 3 P = NP is one way to achieve P-sel = A-sel, we conjecture 
that it is unlikely to characterize P-sel = A-sel. And thus, our P-sel = A-sel suf- 
ficient condition for P-sel C P/0(n) is best viewed as a new algebraic sufficient 
condition quite different from the known (and extremely demanding) structural 
complexity-class-collapse sufficient conditions. 

Finally, we mention two open issues. First, can one prove a complete or 
partial converse of Corollary 3? The second issue is the following. One would 
ultimately like to know whether all P-selective sets have linear deterministic 
advice, i.e., whether P-sel C V /0{n). This paper gave a new sufficient condition 
for that, namely, P-sel C P/0(n) if all P-selective sets have associative (or even 
merely length-associative) selector functions. On the other hand, the task of 
constructing an oracle relative to which P-sel C P/0(n) fails has resisted any 
progress for half a decade. This paper suggests a weaker question that can be 
tackled first: trying to construct an oracle relative to which some P-selective sets 
have no length-associative selector functions. 
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Abstract. P-complete problems seem to have no parallel algorithm 
which runs in polylogarithmic time using a polynomial number of proces- 
sors. A P-complete problem is in class EP (Efficient and Polynomially 
fast) if and only if there exists a cost optimal algorithm to solve it in 
T{n) = 0(t(n)') (e < 1) using P(n) processors such that T(n) x P(n) = 
0(t(n)), where t{n) is the time complexity of the fastest sequential al- 
gorithm which solves the problem. The goal of our research is to find 
EP parallel algorithms for P-complete problems. In this paper we con- 
sider two P-complete geometric problems in the plane. First we consider 
the convex layers problem of a set S' of n points. Let k be the num- 
ber of the convex layers of S. When 1 < k < (0<e<l) we can 

find the convex layers of S in time using p processors, where 

l-e 

1 < p < n 2 . Next, we consider the envelope layers problem of a set 
S of n line segments. Let k be the number of the envelope layers of S. 
When l<fe<n5 (0<e<l), we propose an algorithm for computing 
the envelope layers of S in time using p processors, where 

1 < p < n~s ~ , and a(n) is the functional inverse of Ackermann’s function 
which grows extremely slowly. The computational model we use in this 
paper is the EREW-PRAM. Our first algorithm, for the convex layers 
problem, belongs to EP, and the second one, for the envelope layers 
problem, belongs to the class EP if a small factor of log n is ignored. 



1 Introduction 

In parallel computational theory, one of the primary measures of parallel com- 
plexity is the class NC. A problem is said to be in iVC if there exists an algorithm 
which solves the problem in polylogarithmic time using polynominal number of 
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processors. The class of P-complete problems consists of the most likely can- 
didates of P that are not in NC. However, polylogarithmic time complexity 
is not so important when considering practical parallel computation. Actually, 
the number of processors available is usually small in comparison with the size 
of a problem. Thus, in practice cost optimality turns to be the most impor- 
tant measure for parallel algorithms. The cost of a parallel algorithm is defined 
as the product of the running time and the number of processors required by 
the algorithm. A parallel algorithm is called cost optimal if its cost is of the 
same order as the time complexity of the fastest known sequential algorithm 
for the same problem. Kruskal et al. [8] proposed class EP (Efficient and Poly- 
nomially fast). A problem is in EP if and only if there exists a cost optimal 
algorithm to solve it in T{n) = 0{t{nY) (e < 1) using P(n) processors such 
that T(n) x P(n) = 0(t(n)), where t(n) is the time complexity of the fastest 
sequential algorithm which solves the problem. 

In this paper, we consider EP algorithms for two P-complete geometric prob- 
lems, the convex layers problem and the envelope layers problem on EREW- 
PRAM. Given a set S' of n points in the plane, S can be divided into a sequence 
CL{S) of convex hulls: C'Li(S), CL 2 (S), . . .,CLk{S), where CL^S) {I < i < k) 
is the convex hull of The envelope layers of a set S of (opaque) n 

line segments are analogous to the convex layers of a set of points, with convex 
hulls replaced by upper envelopes. The upper envelope of S is the collection of 
segment portions visible from the point (0,-|-oo), which consists of 0{na{n)) 
subsegments of S. These two problems are P-complete [4,6]. They are still P- 
complete even if I < k < . 

Chazelle [1] proposed an optimal sequential algorithm for the convex layers 
problem which runs in O(nlogn) time. In [5] Fujiwara et al. considered the 
convex layers problem under a very strong constraint, in which they proved that 
if all points of S lie on d horizontal lines, when d < (0 < e < |) the problem 

is still P-complete. They proposed an EP algorithm for the problem which runs 
in 0( "*°^" ) time using p processors if d < (0 < e < ^) and 1 < p < in the 

E REW-P RAM . That is, to achieve cost optimality, there must be d horizontal 
lines such that each line must pass through more than points in average, 
what is unlikely in most cases. Besides, the parameter d does not represent 
the substantial complexity of the problem. In this paper we present a new EP 
parallel algorithm for computing the convex layers of a set S' of n points. Let k 
be the number of the convex layers of S. When 1 < k < ni (0 < e < 1) our 
algorithm runs in 0( "^°^" ) time using p processors, where 1 < p < n^~ , in the 
E REW-P RAM , and it is cost optimal. 

Let S be a set of n (opaque) line segments in the plane. Hershberger [6] gave 
an 0(na(n) log^ n) algorithm for computing the envelope layers of S, where a(n) 
is the functional inverse of Ackermann’s function which grows extremely slowly. 
Here, we also give an algorithm for the envelope layers problem. Let k be the 
number of envelope layers of S. When 0 < k < (0<e<l) our algorithm 

runs in time using p processors, where 1 < p < n“ 2 “, in the 

E REW-P RAM . If we ignore a factor of log n our algorithm belongs to the class 
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EP. In the following sections, we give the outline of our algorithms. The details 
can be found in [2]. 

2 Finding Convex Layers of Points 

In our algorithm the following operations are required: (I) storing a convex layer, 
(II) searching for an edge of a convex layer, (III) finding the common tangents 
of two convex layers, (IV) deleting a set of contiguous edges from a convex layer, 
(V) given two convex polygons P and Q finding the convex hull of P and Q, 
and (VI) given x convex polygons. Pi, P 2 , . . . , Px with / vertices each, finding 
the common tangents of Pi and Pj for all pairs oi i,j {1 < i < j < x). Let the 
sizes of the convex layers and convex polygons are 0(n). We keep convex layers 
in balanced tree. Operation (I) can be done in O(logn) using n processors [7], 
operation (II) and (III) can be done in O(logn) time sequentially [9]. Operations 
(IV) and (V) can be done in O(logn) time sequentially [2]. Operation (VI) can 
be done in 0( iog° 2 ^p°L*°og 2 ^ 2 ) {p < fx^) time using p processors [3]. Let S' be a set 
of n points in the plane and k be the number of the convex layers of S. We sort 
S by its x-coordinates. The following algorithm finds the convex layers of S. 

Algorithm ComputeCL{S) 

[Input] A set S = {pi,P 2 , ■ ■ ■, Pn) of n points in the Euclidean plane sorted by 
their x-coordinates. 

[Output] A set CL{S) = {CLi{S),CL 2 {S), . . . ,CLk{S)) (1 < fc < n) of the 

convex layers of S, where CLi{S) {1 < i < k) is the tth convex layer of S. 

(Phase 1) Divide S into Si, S 2 , . . . , S 1 ^ subsets such that S^ (1 < i < n^) 
1 7^ 2 

contains n~^ points and the x-coordinate of any point of Si is less than the 
x-coordinate of any point of S^+i. 

(Phase 2) In parallel, for each subset Si(l<i<n 2 ) compute the convex lay- 
ers of Si, denoted as CL{Si) = {CLi{Si),CL 2 {Si ), . . . , CLk^Si)) by Chazelle’s 
sequential algorithm, where ki is the number of layers in Si. Store each CLj(Si) 
(1 < j < ki) (1 < i < n ^~ ) in balanced tree. 

(Phase 3) Let S' = Si (1 < t < n” ). Let a; = 1. Find CLx{S), that is, the 
xth convex layer of S, repeatedly as follows. 

(a) Find the convex hull of the outermost layers of S[, S' 2 , . . . , S' , which 

n 2 

obviously is CLx{S). 

(b) Revise S' (1 < i < n ~^ ) as follows: for each z (1 < i < n“ 2 ~ ) in parallel, 
delete the vertices of CLx{S) from S', and reconstruct the convex layers of S'. 

(c) Set x = X + 1. While all of S(, S 2 , . . . , S' ^ are not completely empty 

n 2 

return to (a). I 

In the above algorithm. Phase 1 can be executed in 0(1) time using n proces- 
sors, if S is stored in an array. In Phase 2, CL{Si) can be computed by Chazelle’s 
sequential algorithm [1] in 0{n^^ logn) time. Therefore, all the convex layers of 
Si, S 2 , . . . , S 1 ^ can be computed in 0(rz“^ log n) time using n“ 2 ~ processors. 

n 2 
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We can store CLj{Si) {I < j < h, 1 < i < n^) in balanced tree in 0(log |S'i|) 
time using |5'i| processors. Now let us consider Phase 3. Phase 3(a) can be done 
in O(logn) time using processors [3]. Using the operations we defined, 

Phase 3(b) can be done in 0(fc log n) time using 0{n ^~ ) processors. Since the in- 
structions of Phase 3 are repeated k times, where k is the number of convex layers 
of S, Phase 3 takes totally O(fc^logn) time using 0(n^) processors. Thus, the 
whole algorithm ComputeCL{S) can be executed in 0{max{n^ log n, log n)) 
time using n 2 processors. Thus by using p processors, where 1 < p < n 2 ^ it 

takes 0(wax( ”*°^" , " ^ ^ ^°^" )) time in the E REW-P RAM . Therefore, when 
I <k <n^ (0 < e < 1), the algorithm ComputeCL{S) is cost optimal and runs 
in 0( ”*°^" ) time using p processors, where 1 < p < 

3 Finding Envelope Layers of Line Segments 

Let S' be a set of n (opaque) line segments, and k be the number of the envelope 
layers of S. From the duality of lines and points, the convex layers of lines can 
be found by the algorithm for the convex layers of points. The main idea of our 
algorithm here is to reduce the problem of line segments into that of lines by 
using a segment tree. 

The plane can be partitioned into 2n -I- 1 slabs by drawing vertical lines 
through all the segments endpoints of S. The segment tree of S, denoted as 
ST{S), is built as follows [10]. Construct a complete binary tree with 2n leaves. 
Then let each leaf of ST{S) represent one slab taken in left-to-right order, and 
each internal node represent the union of its descendants’ slabs. Whether original 
slabs or a union of them, are called canonical slabs. Descending from the root 
to the leaves, associate with each node v £ ST{S) a set S'[u], where S']?;] consists 
of the segments or subsegments of S that have its endpoints on the boundary of 
the canonical slab represented by node v, which have not been associated with 
any of v’s ancestors in ST(S). For any node v, the line segments in S[?;] have 
the same left and right endpoints, respectively, therefore, they can be considered 
as lines. On the other hand, for any two nodes x and y on the same level of 
ST{S), the subsets associated with nodes x and y are separated. Thus, for the 
sets associated on the same level their envelopes can be found in parallel by the 
algorithm for the convex layers of points. 

Algorithm C omputeE L{S) 

[Input] A set S of n (opaque) line segments in the plane. 

[Output] A set EL{S) = (ELi{S), EL 2 {S), ..., ELk{S)) {1 < k < n) of the 
envelope layers of S, where k is the number of layers, and ELi(S) (1 < z < fc) is 
the zth envelope layer of S. 

(Phase 1) Use a segment tree, denoted as ST{S), which has m = 0(log n) levels, 
to cut the n segments of S into O(nlogn) subsegments and divide them into m 
separate groups, say Gi, G 2 , . . . , Gm, where group Gi (1 < z < m) corresponds 
to the subsegments associated on level z of ST{S). Reduce the envelope layers 
problem of Gi to the convex layers problem of \Gi\ points. 
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(Phase 2) For each f (1 < f < m) in parallel, find EL{Gi) the envelope layers 
of Gi by using our algorithm for the convex layers. 

(Phase 3) Find the envelope of EL{G\), . . . , EL{Gm) as follows. 

(1) Cutting EL{Gi), . . . , EL{Gm), by vertical lines into separate 
groups Ell, iJ 2 , ■ ■ ■ , E[ 1 ^ . Then, for each i (1 < i < n ^~ ) in parallel, compute 

n 2 

the envelope layers of Hi, EL{Hi) = {ELi{Hi),EL 2 {Hi), . . . , ELh^Hi)), where 
hi is the number of layers in Hi, by Hershberger’s sequential algorithm [6]. 

(2) Let k = max{h\,h 2 , ■ ■ ■ ,h i^). Obtain ELt{S) (1 < t < k), i.e. the 

n 2 

tth envelope layer of S, by concatenating ELt(Hi), ELt{H 2 ), ■ ■ ■ , ELt{H i-e 1 

n 2 

{ELt{Hi) is empty if t > hi). I 

Phase 1 can be done in O(logn) time using n processors. When 1 < fc < 

(0 < e < 1), Phase 2 can be done in " ) time using p processors, and 

Phase 3 can be done in time using p processors, where 1 < p < 

The proof can be found in [2]. 
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Abstract. DNA seqnencing by hybridization is potentially a powerfnl 
alternative to standard gel electrophoresis techniqnes. An important as- 
pect of the approach is the design of the probing scheme and of the 
associated seqnence reconstruction algorithm. Recently a novel probing 
scheme, whose performance is within a constant factor of the informa- 
tion theory bonnd, has settled the issue of asymptotic optimality. Thus, 
the research focus has shifted to the fine tuning of actual performance, 
with enormous potential for the life sciences. In this paper we discuss a 
new algorithmic device, called voting upon failure, which, exploiting the 
knowledge acquired in the course of the sequence reconstruction process, 
achieves typically a 20% performance improvement over the previous 
best technique, and comes at 90%-confidence within a factor 0.5 of the 
information-theory bound. 



1 Introduction 

A central application in molecular biology is the sequencing of DNA, i.e., the 
determination of the sequence of nucleotides of a chosen (fragment of a) DNA 
molecule. In recent years, a radically new technique has been proposed as an 
alternative to the traditional sequencing by gel electrophoresis. This technique, 
called Sequencing by Hybridization (SBH), proposed independently by several 
different research teams [BS91, D89, L88, P89, P91] is based on the use of a 
chip fabricated on a glass substrate. The active area of the chip is structured 
as a matrix, in each region of which (a feature) numerous copies of a specific 
oligonucleotide (a short sequence of DNA nucleotides) are implanted. The chip is 
immersed under controlled conditions within a solution of suitably labeled target 
DNA sequence. A copy of the target DNA will bind (hybridize) to an oligonu- 
cleotide if the oligonucleotide is complementary, in the Watson-Crick sense, to 
one of its subsequences. The labeling of the target allows visualization of the 
chip features containing binding oligonucleotides, thereby yielding a method for 
automatically probing the target sequence for specific subsequences. 

* This work was partially supported by the National Science Foundation under Grant 
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In summary, Sequencing by Hybridization consists of two fundamental steps. 
The first, biochemical in nature, is the acquisition, by complementary hybridiza- 
tion with a complete library of probes, of all subsequences (of a selected pattern) 
of a given unknown target sequence; the set of such subsequences is called the 
sequence spectrum. The second step, combinatorial in nature, is the algorith- 
mic reconstruction of the sequence from its spectrum. Most traditional schemes 
were restricted to probes in the form of strings of k symbols (/c-mers), which 
could achieve no better performance (expressed as the length of reliably recon- 
structible sequences) than the square root of the information-theoretic bound 
[DFS94, PFU99], as prominently noted in a review paper by well-known biolo- 
gist Ed Southern [S96]. 

Recently a novel approach has been proposed [PFU99],[PU00], whose per- 
formance is within a small factor of the optimum. The central feature of this 
approach is the adoption of a probing pattern where natural nucleotides are sep- 
arated by gaps of “wild-cards” , intended to hybridize non-specifically with any 
natural base and proposed to be realized with artificial universal bases [LB94], 
i.e., bases that stack without binding. 

It must be stressed that, since the question of the asymptotic performance has 
been settled, the focus of research in SBH sequence reconstruction has shifted to 
the refinement of the algorithmic procedures, in order to fine-tune computational 
methods which have enormous applicational potential in the life sciences. In this 
paper we shall briefly review the combinatorics underlying the approach, the 
known algorithmic technique, and illustrate a new property which enables a 
significant enhancement of the achievable performance. 

2 Review of the Gapped-Probe Approach 

A key feature of SBH is the adopted probing pattern, i.e., the structure of the 
spectrum probes. Notationally, a probing pattern is specified by a binary string 
(beginning and ending with a 1) where a 1 denotes a natural base and a 0 a 
universal base. For s -|- r = fc, an (s, r) direct probing scheme has the pattern 
ls(os-ii)r, (s,r) reverse scheme has the pattern (10'*“^)”1®. For no- 

tational convenience a probe is viewed as a string of length (r + l)s over the 
extended alphabet A = { A,C,G,T,*}, where * denotes the “wild-card”. 

In association with any such probing scheme the reconstruction of the se- 
quence is algorithmically carried out symbol-by-symbol from one end to the 
other [PFU99, PUOO]. The algorithmic primitive is the interrogation of the spec- 
trum with a query of the form u*, where u is a string of length (r -|- l)s — 1 over 
A, which returns all matching probes. Performance is naturally measured as the 
length of sequences that are reconstructible with a given confidence level, under 
the standard hypothesis that the target sequence is generated by a maximum- 
entropy memoryless source (i.i.d. symbols). 

The algorithm interrogates the spectrum with q*, where q is the current ((r-|- 
l)s— l)-symbol suffix of the reconstructed sequence. If only one probe is returned, 
the sequence is trivially extended; otherwise there is a potential ambiguity and 
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the algorithm attempts the extension of all potential paths up to a predetermined 
length H. The rationale is that the correct path is deterministically extended, 
whereas the extension of the spurious paths rests on the improbable presence of 
(fooling) probes in the spectrum. At the conclusion of this process, the sequence 
is extended with the common prefix of all surviving paths; otherwise, when such a 
prefix is empty, the existence of alternative paths of length H is taken as evidence 
that their extension would continue with no definite bound, and reconstruction 
fails [PUOO]. The performance of such an algorithm at 90%-confidence comes 
within a factor slightly smaller than 0.5 of the theoretical optimum. 

3 Analysis of the Failure Modes 
of the Reconstruction Algorithm 

The best known algorithm for sequence reconstruction [PUOO] has two failure 
modes, distinguished on the basis of the paths issuing beyond the ambiguous 
branching embodying the reconstruction failure. The failure modes can be clas- 
sified as follows: 

1. Failure Mode 1. There are two paths identical except for their initial symbol 
(corresponding to the branching). This failure has been caused by k fooling 
probes scattered, with possible overlaps, along the target sequence. On the 
basis of the spectrum alone there is no way to decide which of the multiple 
symbols detected at the branching corresponds to the correct extension. 

2. Failure Mode 2. There are no two paths as specified for Failure Mode 1. 
If H is large enough to make probabilistic extension of a spurious path to 
length H extremely unlikely, then there must be an ( (r -I- l)s — l) -segment of a 
spurious path, including or following the branching position which is identical 
to an actual segment occurring in the sequence. Obviously, in such a case 
extension of this path is deterministically assured and the reconstruction 
fails. (Such substrings are referred to as self-sustaining.) The self-sustaining 
segment agrees, entirely or partially, with an equally positioned segment of 
the correct path, the disagreements being concealed by fooling strings also 
occurring in the sequence. 

Failure Mode 1 is well understood and discussed elsewhere [PUOO]. Suffice it 
here to recall that, denoting with a = 1 — probability of occurrence 

of a specific probe in a target sequence of length m, an accurate estimate of the 
probability of failure by Mode 1 has the expression: 

P^ = l- 

Before analyzing Failure Mode 2, it is appropriate the study the two following 
examples (observed in simulations of random target sequences for the probing 
pattern (4,4): 1111 0001 0001 0001 0001). We have: 
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Example 1 



GCAT TACT GAAC GGTT ACA [T] TGAT 

[G] TCAC 



ATGA TGC. .. 
AAGC TAT... 



. . . AC gaac ggtt aca g 

* 


t c a c a a g 

* * * 


C TAT . . . 




Example 2 








ATAT GCAC GCCT [G] AGT GTAT 
[T] AGT GTAT 


ATTC TGGC 
ATCA TAGG 


CACT 

TAGG 


GCCA 

TGAC 


. . . TCGC GCGG T AAC g t a t 
* 


atca tagg 
* * * * 


t a c g 
* * 


t g a C 
* * * 



CG ... 
AG ... 

AG ... 



In the above examples the first two lines illustrate the branching event (symbols 
within brackets “[ ]”). Adequate prefixes of the two paths issuing from the 
branching are used to locate matching segments in the original sequence. The 
top path agrees with the putative sequence, whereas only a suffix of the bottom 
path agrees with an actual substring of the sequence. The latter is shown in the 
third line: the 19-symbol subsegment shown in lower case letters (referred to 
as the self-sustaining segment) supports the indefinite extension of the spurious 
path. This segment disagrees with the top (correct) segment in the positions 
labelled and these disagreements must be concealed by fooling probes. Note 
that the self-sustaining segment may not begin more than 19 = (r -|- l)s — 1 
positions to the left of the branching; in Example 1 it starts on the left of the 
branching, in Example 2 on its right. 



Next we analyze in detail this failure mode. The self-sustaining segment has 
length V = (r -|- l)s — 1. This length is indeed sufficient for indefinite path 
extension. If the branching position is conventionally denoted as 0, the first 
position J of the self-sustaining segment, referred to as the offset, must satisfy 
the condition J > —v. The failure events corresponding to each value J of 
the offset can be constructed by first selecting two positions in the sequence 
(in « m^/2 ways), so that its probability, for some coefficient ttj, can be 
expressed as 




The term ttj, referred to as the multiplier, depends upon the disagreements 
between the self-sustaining segment and the correct path. 

The analysis of ttj is greatly aided by an s-row arrays (shown in Figure 1 for 
s = 5 and r = 3), whose cells correspond to sequence positions with reference to 
the branching position and are numbered starting from —v = — (r -I- l)s -I- 1 in 
column-major order. In the array we identify as the critical set the set of cells 
0, s,2s, . . . , rs, rs -I- 1, rs -I- 2, . . . , (r -I- l)s — 1 (shown crossed-out in Figure 1), 




68 



Samuel A. Heath and Franco P. Preparata 





-15 


-10 


- 






m 


20 














-19 


-14 


-9 


-4 ■ 




B 


D 


21 








-18 


-13 


-8 


-3 l 2 


|H 




B 










-17 


-12 


-7 


1 " 


IH 














-16 


-11 


-6 


U A 


H 















(a) 





-15 


-10 




, i ; ii 


20 






-19 


-14 


-9 


-4 


1 6 11 1^ 








-18 


-13 


Q 


-3 










O 


1 — 




-17 


-12 


-7 


-2 


3 t 13 |_lg 








-16 


-11 


-6 




4 y 14^^ 









I 

(b) 



Fig. 1. Illustration of offset domains fot J = —7 and J = 7. Evidence is given 
to constrained symbols (light shading), critical set (crossed), and selectable rows 
(heavier shading) 

which represent the positions of the last symbols of the fooling probes necessary 
to support the branching. A value J of offset identifies in this array the domain 
(offset domain) of the self-sustaining segment (two typical offset domains are 
shown in Figure 1 for J = —7 and J = 7). For each value of the offset the 
following parameters are significant: 

1. The number of deterministically constrained sequence symbols at the self- 
sustaining segment site. Since no fooling probe can conceal disagreements 
that precede the branching, all the symbols preceding the branching are 
constrained (cells shown lightly shaded in Figure 1(a)). 
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2. The size of the intersection with the critical set. Each cell of this intersection 
corresponds to a fooling probe which is to match the self-sustaining segment. 
(Such sets, shown horizontally hatched, have size 3 in 1(a) and size 6 in 1(b).) 

3. The number of cells of the critical set with value smaller than J (such a set 
is empty for J < 0 and is shown vertically hatched in Figure 1(b)). Each 
such cell identifies a weak fooling probe. 



All cells in the self-sustaining segment domain with positive index and not 
belonging to the critical set are called selectable and may be sites of disagree- 
ments between the correct path and the self-sustaining segment (shown heavily 
shaded in Figure 1). The usefulness of the above diagram derives from the fact 
that, due to the pattern of fooling probes required by a disagreement (refer 
to the critical set as an example), rows of selectable cells are independent and 
contribute multiplicatively to ttj. 

It follows that the basic components of the analysis are the rows of se- 
lectable cells. Consider a row of u selectable cells (u-cell row). If there are no 
disagreements, then the row contributes (1/4)“ to the multiplier; otherwise, let 
h {h = 1, . . . ,m) be the leftmost cell of the row where a disagreement occurs. 
In such a case the contribution of the row is (l/4)^“^(3/4)o;““^+^, since ft. — 1 
symbols agree (each with probability 1/4), the ft-th symbol disagrees (with prob- 
ability 3/4) and must be confirmed by u — ft -I- 1 fooling probes, regardless of 
the identity of the symbols in cells to its right on the row. Summing over ft we 
obtain 



1 

4 « 



-k 






1 3 



/^h — 1 ^ 

h^l 



-a 






1 

4 « 



U—1 

(1 -I- 3a y^(4g)") 
i=o 



We let for simplicity of notation /3„ = l-|-3a contribution 

of a M-cell row becomes 

4« 

For convenience of analysis, values of J are aggregated on the basis of the 
size cr of the intersection of the placement domain with the critical set. These 
sets, denoted Group I,. . . , Group VI are illustrated for our running example in 
Figure 2, as differently shaded domains of cells. Cells in the same domain are 
covered by identical analysis. 

In general, these sets of offset are: 



1. Group I: J = —v, a = 0; 

2. Group II: J = —v + 1 + is + j, i = 0,1, ... ,r — 1, j = 0,1, s — 1; 

3. Group III: J = — s -I- 2, —s + 3, . . . ,0, a = r + l,r + 2, . . . , k — 1; 

4. Group IV: J = 1 -I- zs -I- J, i = 0, 1, . . . , r — 1, J = 0, 1, . . . , s — 1; 

5. Group V: J = rs + l,rs + 2, ... ,v, a = s — 1, s — 2, . . . ,1; 

6. Group VI: J > v,a = 0. 



In the full paper we shall report the detailed analysis of the contribution 
of each of these sets to the probability of failure. In this extended abstract, we 
illustrate in total detail the analysis of Groups I and II. It must be mentioned 
that, typically, the contribution of Group II accounts for about 4/5 of the total. 
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Fig. 2. Partition of values of J into Group I,. . . , Group VI 



1. Group I: J = —v, ct = 0; The self-sustaining segment fully agrees with the 
correct path, and the corresponding event has multiplier tt-i, = 
because symbols are fully constrained and one (the branching) can be 
chosen in 3 ways. 

2. Group II: J = —ly -I- 1 -I- is -I- j, i = 0, 1, . . . , r — I, j = 0, 1, . . . , s — 1, so 
that CT = i -I- I. We observe that the | J|-prefix of the segment fully agrees 
with the correct segment, thereby constraining |J| sequence symbols. (In 
fact, disagreements may only occur at selectable cells with index >0.) For 
each value of J, we form the vector of the lengths of s selectable-cell rows. 
This is illustrated in Table I, whose columns, indexed by the parameter j, 
are the length vectors. 



Table I. 





Group II 


j 


1 


2 


3 


s 


row 1 


0 


0 


0 


0 


row 2 


i 


z + 1 


i + 1 


i + 1 


row 3 


i 


% 


i-\-l 


i + 1 


row s 


i 


i 


i 


i + 1 



Placement J = —v + is + j constrains |J| sequence symbols and is accompa- 
nied by (id- 1) fooling probes of the critical set. Also, since the multiplicative 
contribution of a rt-cell row is /3u/4“, we conclude that the multiplier is 



4 4“'^ \4*+^ ) \4 / 



3a(4g)^ /A+iV \s-i 

4-^ V A ; 
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If we sum over j and i we obtain the global contribution of Group II: 



i=o j=i \ yi- / 




1 ) 



4 Sequence Extension Based on Voting 

In the presence of the any of the previously described failures, the standard algo- 
rithm is unable to provide a reliable reconstruction. Although for Failure Mode 
2 prior knowledge of the approximate sequence length may be used to discrimi- 
nate between the two paths being deterministically extended (by observing that 
the spurious extension will terminate with a sequence of length different from 
the nominal one), such a criterion is inapplicable to Failure Mode 1, since both 
extended paths are identical beyond the branching symbol. 

However, there is one important item of information that the advanced algo- 
rithm fails to use, i.e., the content, when failure occurs, of the prefix of the 
putative sequence currently constructed. In such a situation we have two alter- 
native sets of fooling probes, respectively corresponding to the two competing 
extensions. We may verify which of the two sets is better represented in 
and use this information to discriminate between the two alternatives. Following 
intuition, since failure has occurred, one set of fooling probes (the one causing 
the spurious extension) is guaranteed to exist, and its members are assumed to 
be scattered along the sequence. Therefore we adopt the criterion to reject as 
spurious the path whose fooling probes are better represented in the prefix 
Failure occurs also in the case of a tie, for in such a case discrimination is not 
possible. We now wish to analyze probabilistically the behavior of this policy, 
referred to as voting. 

Assuming that there are just two alternative extensions (the event of more 
simultaneous extensions is extremely rare), the discrimination is carried out on 
the basis of the competing sets of fooling probes. The analysis is straightforward 
for Failure Mode 1, since in such a case each probe set consists of the k probes 
sampling the branching position, because all other probes extend both paths. 
For Failure Mode 2, the analysis is more subtle. Referring to the preceding 
Example 2, each of the positions marked with an aterisk entails a train of at 
most (r -I- 1) fooling probes whose last symbol occurs not beyond the end of the 
self-sustaining segment. The union of such trains of probes is the fooling probe 
set for the chosen alternative. However, the self-sustaining segment cannot be 
determined by the algorithm; based on the fact that long transition segments 
are improbable, the algorithm constructs the discriminating sets on the basis of 
the disagreements occurring within (r-|- l)s — 1 positions from the branching. In 
the analysis, we shall use exactly (r -|- l)s — 1 probes, which is a valid criterion 
since each unrequired probe detected in adds 1 to both probe counts. 

Let 1 < t' < m be the position of failure and let be the common size of 
the competing probe sets, with z = 1,2 corresponding to the two modes (i.e., 
Hi = k and H 2 = (j + l)s — 1). Assume that there are j probes of the incorrect 
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extension in the range [1,^]. Since failure has occurred, the fXi probes of the 
incorrect extension set are scattered along the sequence, and the probability 
that a specific probe of this set occurs in a(iy) is 7 = Ijm. It follows that the 
probability of finding exactly j probes in 0 ( 17 ) is 

Let qf' {€) be the probability that there are at least j probes of the correct 
extension set in 0 ( 17 ). Since the probability of occurrence of a specific probe is 
now (5 = 1 — , we obtain 



Therefore the quantity is the probability of incorrect vote in 

correspondence to a failure of Mode i. We also let ’Ki{t) denote Prob(£) for Failure 
Mode i. We note that 7Ti{t) is uniformly distributed in the range [1 ,to], whereas 
7 T 2 (^) decreases linearly in the same range (in fact, in this case £ is the smaller 
of two values chosen in The probability that voting fails, conditional on 

the occurrence of the appropriate type of failure, is therefore 

m Hi 

P(vote failure|Mode i) = 

j-0 



m Hi / \ Hi / \ 

= E E (i) 

In conclusion we obtain the probability of failure <j)(m) as a function of m 

= P(vote failurejMode l)Pi + P(vote failurejMode 2 )P 2 (2) 

< Pi + P2 

and the equation = e quantifies, for a given e, the performance enhance- 

ment due to the voting approach. 

Unfortunately, Expression (1) is not easily amenable to a closed form. How- 
ever, numerical calculations of 4>{m) have been carried out and are summarized 
in Figure 3 for a specific illustration: A (4,4) probing scheme, i.e., k = 8. Al- 
though specific, this example is quite significant because it represents, in a sense, 
the accepted state of the art in terms of the microarray size. Figure 3 displays 
the probability of incorrect voting for Mode 1 failure as a function of £ (point of 
failure) and m (sequence length). 

As expected, this function increases with m for fixed £ and tends to 1 as £ 
decreases, since in such a condition the tie j = h = 0 becomes the most 
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Fig. 3. Probability of incorrect voting for Mode 1 failure as a function of I and 
TO {£ <m). 



likely event. The quantities P(vote failure] Mode i) are obtained by averaging the 
previous functions over the parameter £. Finally, the quantities Pi, P 2 , P{yote 
failurejMode 1) and P(vote failurejMode 2), each a function of to, are combined 
as in (2) to obtain the desired performance function The latter is pre- 

sented as 1 — (j){m) in Figure 4, to be contrasted with the probability of success 
of the previous algorithm [PUOO] without the voting provision. 

Shown in the same figure are sets of experimental data points for the two 
situations, which are in remarkable agreement with the corresponding analyses. 
At 90%-confidence the length of the correctly reconstructible sequence increases 
by nearly 20%, which quantifies the enhancement afforded by the approach dis- 
cussed in this paper, quite likely at the frontier of achievable performance. 
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Abstract. We consider a weighted generalization of multiple sequence 
alignment with sum-of-pair score. Multiple sequence alignment without 
weights is known to be AfP-complete and can be approximated within 
a constant factor, but it is unknown whether it has a polynomial time 
approximation scheme. Weighted multiple sequence alignment can be 
approximated within a factor of O(log^n) where n is the number of 
sequences. 

We prove that weighted multiple sequence alignment is MAX iSA/”P-hard 
and establish a numerical lower bound on its approximability, name- 
ly III — £. This lower bound is obtained already for the simple binary 
weighted case where the weights are restricted to 0 and 1. Furthermore, 
we show that weighted multiple sequence alignment and its restriction 
to binary weights can be approximated exactly to the same degree. 



1 Introduction 

Multiple sequence alignment (MSA) is an important problem in computational 
biology. The alignment of a group of protein or nucleotide sequences yields in- 
formation about the relationships between these sequences and it is also used 
to detect similarities (so called “homologous regions”) between them. This in- 
formation is applied in constructing evolutionary trees and finding coherences 
between the function and structure of proteins and their sequences. 

Many objective functions have been suggested to measure the quality of a 
multiple sequence alignment. One of the most widely used is the so called sum- 
of-pair score (SP-score, Carrillo et al. [6]). 

MSA with SP-score is known to be AfP-complete (Wang et al. [12]). For the 
case that the scoring function does not have to be a metric, Just has shown that 
MSA with SP-score is MAX iSAfP-hard [9]. Akutsu et al. have investigated the 
multiple sequence alignment problem under several scoring functions, namely 
#LOG#-score and /C-score [1]. They have shown that a variant of the multiple 
sequence alignment problem called local multiple alignment is MAX iSAfP-hard 
under these scoring schemes. 

* supported by DFG research grant RE 672/3 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 75-85, 2001. 
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However, if the scoring function fulfils the triangle inequality, no lower bound 
for this problem is known so far. The complexity of MSA over an alphabet of 
fixed size with metric SP-scoring functions is of main interest. According to Jiang 
et al. the approximability of MSA with metric SP-score is an important open 
problem in computational biology [8]. 

To represent existing knowledge about the relationships of the sequences con- 
sidered, a weighted variant of MSA was introduced by Wu et al. [13] . Each pair of 
sequences is assigned a nonnegative value reflecting their degree of relationship. 
This means that a pair which is assumed to be closely related will be assigned 
a high weight while a less related pair will be assigned a smaller weight. This 
generalization of MSA is called weighted MSA, or WMSA for short. 

In this paper we also examine a restricted version of WMSA called binary 
weighted MSA (BMSA), where the weights are restricted to 0 and 1. The binary 
weights can be used to represent an arbitrary graph over which multiple sequence 
alignments can be determined. We will prove that BMSA is equivalent to WMSA 
with respect to their approximability. Thus, an approximation algorithm for 
BMSA directly yields an approximation algorithm for the general case with 
the same performance ratio. Moreover, we prove the MAX iSAfP-hardness and 
a numerical lower bound for the approximability of BMSA. These results are 
obtained even if the sequences are of fixed length and the alphabet is of fixed 
size. Thus, the difficulty of multiple sequence alignment is caused by the number 
of sequences, not by their length. 

In the next section we give a formal definition of the problems considered. 
The reduction from WMSA to BMSA is presented in section 3. In section 4 
we prove a lower bound for the approximability of a problem called MAX-E2- 
neg-Lin2. This result will be used in section 5 to prove a lower bound for the 
approximability of BMSA. 



2 Definitions and Notations 

Let X be an alphabet and S' := XU{— }, where ” denotes a gap symbol. S[l] 
denotes the /-th symbol of a sequence S. Let S = {S'!, . . . , An} be a family (a 
multiset) of sequences over S. An alignment of 5 is a family A = {5i, . . . , S„} of 
sequences over S' such that all Si have equal length and Si is obtained from Si 
by inserting gaps. The following is an example of an alignment of three sequences 
ATTCTG, TTCTTTG and ATTGTT. 

ATTGT — G 
-TTGTTTG 
ATTGT-T- 

A function d : — >■ N will be called scoring function if it is a metric, 

i.e. for any x,y,z € S' we have d{x,y) = 0 iS x = y, d{x,y) = d{y,x), and 
d{x, z) < d{x, y) + d{y, z). We define the distance of two sequences Si and Sj of 
length I as D{S^,Sj) := J2[^id{Si[k], Sj[k]). 
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Carrillo and Lipman introduced a scoring scheme for alignments called sum- 
of-pair score (SP-score, [6]). The SP-score of an alignment A = {^i, . . . , Sn} is 
defined by D{A) := D{Si, Sj). Multiple sequence alignment (MSA) 

is the problem of finding an alignment with minimum SP-score. 

Wu et al. generalized MSA to weighted sum-of-pair score [13]. The weights 
are given by W := {wSi,Sj)Si,SjeSj a symmetric matrix of nonnegative integers. 
Then the weighted SP-score of an alignment A is Dw{A) := X)i<i<j<n ‘ 

D{Si,Sj). This generalization is called weighted multiple sequence alignment 
(WMSA). The aim is to find an alignment with minimum weighted SP-score. 

An instance of WMSA is a 4-tuple (If, 5, d, W). We consider the case of a 
fixed alphabet S and a fixed scoring function d. Thus, a problem instance of 
WMSA is given by a pair (5, W). It is easy to see that any lower bound for this 
case also holds if we allow arbitrary scoring functions and alphabets. 

A special case of WMSA is binary weighted MSA (BMSA), where the weights 
are restricted to 0 and 1. 

It has been shown that MSA with SP-score is AfP-complete [12]. For an 
arbitrary fixed constant r, MSA can be approximated in polynomial time within 
a factor of 2 — where n > r is the number of sequences [4]. It is unknown 
whether MSA admits a polynomial time approximation scheme (PTAS, see e.g. 
Ausiello et al. [3]). WMSA with arbitrary weights can be approximated within 
a factor of 0(log^ n) [13]. Using a technique of Bartal [5] one can obtain a 
randomized 0(logn • Hog n) approximation. 

Papadimitriou et al. introduced a class of optimization problems called 
MAX SNV [10]. They showed that there exist problems which are MAX SNV- 
complete with respect to L-reductions. In the following, opt(I) denotes the op- 
timal score of an instance I of an optimization problem. For example, opt(5) 
denotes the score of an optimal (weighted) alignment of S. 

Definition 1. Let II and II' he two optimization problems. Then II L-reduces 
to n' if there exist polynomial time computable functions f\, /2 and constants 
7 i ,72 > 0 such that for each instance I of II : 

1. Function fi produces an instance I' = f\{I) of II' such that opt(I') < 

7i •opt(I). 

2. Given a solution S' of I' with cost c'(S'), function /2 produces a solution S = 

f 2 {I,S') of I with cost c{S) such t/iot |c(S') — opt(/)| < 72 • |c'(S") — opt(/')|. 

No MAX SAfV-haxd problem has a PTAS, unless AfV = V (Arora et al. [2]). 

3 Reduction from WMSA to BMSA 

Let S = {S'!, . . . , Sn} be a family of sequences over S and W = (wSi,Sj) be a 
weight matrix. Let I be the maximal length of the sequences in S and dmax be 
the maximum of the scoring function d. We assume that the weights and the 
scoring function are unary coded. This does not seem to be a restriction because 
in practice the weights are very small and the scoring function is fixed. 
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We construct a family of sequences S' as an instance of BMSA as follows. Let 
K := 2-dynax‘l- For a sequence Sj £ S generate K copies Tj^ £ S' (1 < k < K) of 
this sequence. Furthermore, for each 1 < i < n construct wst,Sj copies 5*’^ £ S' 
(1 < M < wSi,Sj) of Sj. The weight matrix W' = (w'j is given by 

r 1 if / EE and J = si'" , 

w'l j •= ^ 1 if / = S'j" and J = Tj or vice versa , 

[ 0 otherwise , 

where A = B means that A and B are not only equal but denote the same 
sequence. 

Since the weights and the scoring function are unary coded, the input size 
N of the instance of WMSA fulfils the bound N G Q{n ■ I + wSi,Sj)- On 

the other hand, the input size N' of the constructed instance of BMSA satisfies 



N' £ 0{ n- K ■[ + 1 ■ wSi,Sj + {n- K + E”j=i • 



w 



Note that N' is polynomially bounded by N. 

Lemma 1. If S has an alignment A with weighted score Dw{Af) then S' has 
an alignment A' with score D\v>{A') = Dw{A). 

Proof. Let A = {Ai, . . . , be an alignment of S with weighted score D. We 
obtain an alignment A' = {A\A G 5'} of S' by setting T* = Sj and S'f" = Sj 
for all j, k, i, gi. The score of A' with respect to the weight matrix W' is 

i,j — l fi—1 k—l'^ ^ 1 ^ 

=0 =DiSi,Sj) 



□ 



Lemma 2. Given an alignment A' of S' with weighted score Dw'{A') we can 
construct an alignment A of S with less or equal score in polynomial time. 

Proof. Let A' = {A\A £ S'} be an arbitrary alignment of S' with score Z?w'(-4')- 
The copies of a sequence Sj £ S will be called consistent if there exists a sequence 
Bj with = Bj and S'f" = Bj for all k,i,P^- The sequence Bj is called block. 

We consider the case that for some jo the copies of Sj^ are not consistent 
and distinguish two cases. First, if not all are equal, let 
Dk-.= Y:=iYZ=i^ D{fl'Sl") 

be the score of with the sequences S'* A Choose fcp such that L>fco is minimal 
among all and set for all k ^ ko. This way we obtain a new 

alignment with less or equal score. 
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Now we consider the case that there exists a Bj^ such that = Bj^ for all 
k. Then there exists a sequence yf Bjg. This sequence yields at least score 

K with the sequences , because it yields a score of at least 1 with every . 
Set = Bjg. Then yields score 0 with any and at most score K 

with . Thus, the new alignment has less or equal score. 

By these modifications we iteratively obtain a new alignment of S' such 
that for any j G {1, . . . ,n} the copies of Sj are consistent with block Bj. The 
blocks of S' induce an alignment A = {Bi, . . . , Bn} of S with score D]^{A) = 
■ D{B^,Bj) < Dw'{A'). □ 

With these results we have shown that a A-approximation for BMSA can be 
used as a A-approximation for WMSA. Thus, the following theorem holds. 
Theorem 1. If BMSA can be approximated within a constant factor A in poly- 
nomial time, then WMSA can also he approximated within A in polynomial time. 

□ 



4 The Non-approximability of MAX-E2-neg-Lin2 

We consider the multiplicative group {1,-1}. Let G = {G\, . . . , Gt} be a multiset 
of linear equations over the variables U = {x \, . . . , Xr}, Gi=Xci j ■ j, = 

dii k > 2, G r}, and Oj G {1,-1} is a constant. MAX-Ek-Lin2 

is the optimization problem of finding the maximum number of simultaneously 
satisfiable equations. A restriction of MAX-Ek-Lin2 is MAX-Ek-neg-Lin2, where 
= — 1 for all 1 < i < t. 

MAX-E2-neg-Lin2 is exactly the problem MAX-Cut (see e.g. [3]) where 
the equations correspond to the edges, the variables correspond to the nodes, 
and multiple edges are allowed. Therefore, MAX-E2-neg-Lin2 is MAX SAfV- 
complete [10]. We use MAX-E2-neg-Lin2 here due to the simpler notation. 

An instance of MAX-Ek-Lin2 or MAX-Ek-neg-Lin2 consisting of t equations 
will be called rj-satisfiahle iff 77 • t is the maximum number of simultaneously 
satisfiable equations. Hastad proved in [7] that it is AfP-hard to distinguish 
(1 — e) -satisfiable and (| -b e) -satisfiable instances of MAX-E3-Lin2 for any 
e > 0. 

Instead of the known lower bound for the approximability of MAX-Cut 
(Hastad [7] and Trevisan et al. [11]) we will construct a reduction from MAX-E3- 
Lin2 to MAX-E2-neg-Lin2 to prove that it is AfP-hard to distinguish (|| — e)- 
and (II -b e)-satisfiable instances of MAX-E2-neg-Lin2 for any e > 0; the gadget 
used by Trevisan et al. [11] does not yield such a gap directly. This result will be 
used in section 5 to establish the lower bound for the approximability of BMSA. 

We will now reduce MAX-E3-Lin2 to MAX-E2-neg-Lin2. Let G={Gi, . . . ,Gt} 
be a multiset of equations over variables U, Gi=Xoi i ■ 2 • 3 = a^. 

We construct an instance G' of MAX-E2-neg-Lin2 with 22 • t equations and 
4 • t -b 2 • r -b 2 variables. The reduction is similar to the reduction from MAX- 
E3-Lin2 to MAX-E2-Lin2 in [7]. The set of variables U' is given by 

U' = {xj,xj\l <j<r}U {z^,z~} U {pi,l,p^, 2 ,Pi, 3 ,P^,z\f <i <t} . 
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Note that if an assignment satisfies an equation of an instance of MAX-E2-neg- 
Lin2, then the negated assignment also satisfies the equation. So without loss of 
generality we assume that in any case z~^ = 1. 

We interpret = xj. We call an assignment consistent for xj if x~^ yf xj 
and therefore x'j = Xj = {—x~). An assignment that is consistent for every xj 
and where z~^ yf is called consistent. 

For an equation Gi=Xon i 'Xai 2 we construct the twelve equations 

■ Pi,Q' = “1 9, q' = 1,2,3 and g yf g' , 

<?= 1,2,3 , 
forg=l,2,3. 

We add either the four equations • pi^q = —1 {q = 1, 2, 3) and • pi^z = —1 
if Oi = 1 or the four equations • pi^q = —1 {q = 1, 2, 3) and 2 ;+ • pi^z = —1 if 
Oi = —1. For every equation in Q we construct the three equations xf^. ^ ■ x~. ^ = 
— 1 {q = 1,2,3). Finally, we add the equation z~^ ■ z~ = —1 three times. Note 
that Q' contains 3 • t times the equation z~^ ■ z~ = —1. Let nj be the number 
of occurrences of the variable Xj in Q. Then Q' contains Uj times the equation 

4 • ^7 = - 1 - 

For every equation Gi G Q we have constructed 22 equations for Q' . These 
22 equations are called the representation of Gi. 

Lemma 3. Let an arbitrary assignment for U he given. Assign z~ = —1 and 
Xj~ = Xj, x~ = {—Xj) for j = 1, . . . , r. Then for any i G {1, . . . , t} there exists 
an assignment for pi^i, pi^ 2 , Pi, 3 , and pi^z such that 18 equations of the repre- 
sentation of Gi are satisfied if Gi is satisfied by the given assignment and 16 
equations of the representation are satisfied if Gi is not satisfied. 

It is not possible to satisfy more than 18 equations of the representation if 
Gi is satisfied by the assignment and to satisfy more than 16 equations if Gi is 
not satisfied by the assignment. 

Proof. The lemma can be proved by testing all possible assignments. □ 

If an assignment for U satisfies g of the t equations of Q, then the corre- 
sponding consistent assignment for U' satisfies l6-t-\-2-g equations of G' . This 
assignment can be found efficiently by adjusting the assignment for pi^i, pi^ 2 , 
Pi^ 3 , and Pi^z- On the other hand, a consistent assignment for U' that satisfies 
\Q-t-\-2- g equations of G' yields an assignment for U that satisfies g equations 
ofG. 

Lemma 4. Given an arbitrary assignment for U' that satisfies 16 • t 2 ■ g 
equations of G' , a consistent assignment that satisfies at least this amount of 
equations of G' can be computed in polynomial time. 

Proof. First assume that z~^ = z~ in the given assignment. Then the 3 • t equa- 
tions = —1 are not satisfied by the assignment. Let = {—z~^). Then 

these 3 • t equations will be satisfied. On the other hand, occurs in only 3 • t 
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other equations. Thus, at most 3 • t equations are no longer satisfied. Altogether 
the number of satisfied equations is not decreased by this modification. 

If there exists a j with x'j' = x~ , then there are Uj equations x~^ ■ x~ = 
— 1 that are not satisfied by the assignment. Let xj = (— x^). Then the nj 
equations x'j' ■ xJ = —1 are satisfied by the modified assignment. On the other 
hand xJ occurs in only nj other equations. Thus, at most rij equations are no 
longer satisfied. The number of satisfied equations is thus not decreased by this 
modification. 

This way we iteratively obtain a consistent assignment. Obviously, the mod- 
ifications can be computed in polynomial time. □ 

Now we can prove the following theorem used in section 5. 

Theorem 2. For any e > 0 it is NV-hard to distinguish ( ^ — e) - and ( ^ + e) - 
satisfiable instances of MAX-E2-neg-Lin2 . 

Proof. An instance of MAX-E3-Lin2 is ry-satisfiable iff the corresponding in- 
stance of MAX-E2-neg-Lin2 is -satisfiable. According to Hastad [7] it 

is AfP-hard to distinguish (l — ^)- and (| + C)-satisfi£^ble instances of MAX- 

E3-Lin2 for any ^ > 0. Thus, it is AfV-haxd to distinguish ^ )~ 

( ) -satisfiable instances of MAX-E2-neg-Lin2. Choosing ^ = 11-e com- 
pletes the proof. □ 

Since MAX-Cut and MAX-E2-neg-Lin2 are exactly the same problem, we 
obtain the same approximability gap for MAX-Cut. 

Corollary 1. For any e > 0 it is MV-hard to decide whether the maximum 
cut of an instance G = (V,E) (where multiple edges are allowed) of MAX-Cut 
consists of at most -I- e) • \E\ or at least — e) • \E\ edges. □ 

5 The Non-approximability of BMSA 

In this section we reduce MAX-E2-neg-Lin2 to BMSA. Let Q = {Gi , . ..,Gt} 
be an instance of MAX-E2-neg-Lin2 over a set of variables U = {x\, . . . ,Xr}, 
Gi=Xciii ■ 2 = We construct a family of sequences 

S = {Z} U {Xj\j = 1, . . . , r} U ) f} over the alphabet E = 

{•, o,x}. Let Z := oooooooo be a sequence of length 8. Z will be used as a 
control sequence. For j G {!,..., r} let Xj := * 0000000 * be a sequence of 
length 9 that represents the variable xj G U. For each i G {1, . . . , t} create two 
sequences Tip := * 00 x 0 x 00 * and Tip := * 000 x 000 *, each of length 9. Yi^q 
represents the variable ^ in Gi. 

The scoring function is given in the following table. Note that it is a metric. 





- 


• 


0 


X 


- 


■q 


T 


y 


X 


• 


1 


0 


1 


4 


0 


2 


1 


0 


3 


X 


5 


4 


3 


0 
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The weight matrix W = (wi^j)ij^s is given by 

r 1 if / = Yi^q and J = , 

_ I 1 if / = Z and J = Yi^q or vice versa , 

’ I 1 if / = Yi^q and J = Xa^^g or vice versa , 

[ 0 otherwise . 

The set Si = {Yi^i,Yi^2, X^^ ^ will be called the representation ofGi. Note 
that in general a sequence Xj occurs in more than one representation. 

Let A = {^|S' G 5 } be an alignment of S. Then Di{A) denotes the score of the 
equation Gi, Di{A) = D{Yi^i, Yi^2) + D(Yi^i, Xai,i) + D(Yi^2, + ^) + 

D{Yi^2,Z). By the construction of the weight matrix, Dw(A) = J 2 l=i 
holds. 



Definition 2 . An alignment A = G 5 } of S will be called variable- 

consistent with respect to an assignment for U if, after eliminating all columns 
consisting solely of gaps (which do not affect the score), the following holds for 
all j, i, and q: 



1 . 



3 . 



Z = -Z- 






Y — 

J t^q — 






-Y, 



t,q 



ifxj = -1 
ifxj = 1 

= -1 
= 1 



The following lemma follows immediately from this definition. 



Lemma 5 . An alignment is variable- consistent iff for all i = l,...,t and q = 
1 , 2 the following properties hold: 



A. Either Ti ^[ 1 ] or ^[ 9 ] matches a gap in Z. No other character of Z or Yi q 
matches a gap in the other sequence. 

B. No character in either of the two sequences Yi^q, X^^ ^ matches a gap in the 

other sequence. □ 



These properties are referred to as property A and B. The following is an 
example of a variable-consistent alignment representing the equation Gi=x\ ■ 
X2 = — 1 which is satisfied by Xi = — 1 and X2 = 1 . 

Yi I = •00X0X00* — 

1^2= — *000X000* 

Xi = *0000000* — 

A2 = — *0000000* 

Z = — 00000000 — 



Note the functional region of a pair Yi^i,Yi^2 given by the triples xox and 0x0. 
If and Yi^2 represent the same value, the functional region yields a weighted 
score of 9 . Otherwise, it yields a weighted score of 3 . If an alignment A is variable- 
consistent, we have Di{A) = 29 if Gi is satisfied by the represented assignment 
and Di{A) = 31 otherwise. 
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The next two lemmas have similar proofs. Thus, we only give a proof of the 
first. 

Lemma 6 . Alignments of the pairs {Yi^i,Z} and {Yi^ 2 ,Z} yield scores of 8 and 
5, respectively, if they fulfil property A. Violating property A yields scores of at 
least 10 and 7, respectively. 

Proof. An alignment of {Yi^i, Z} that fulfils property A yields score 8 . 

Let us consider an alignment of {Yi^i,Z} that does not fulfil property A. 
Then at least one of the characters [2] , . . . , Yi^i [ 8 ] , Z[l] , . . . , Z[8] matches a 
gap in the other sequence. 

We distinguish two cases. If there is an “x” in Yi^i matching a gap in Z, then 
the alignment yields a score of 5 for this “x” plus 3 for the other “x” plus 1 for 
each So altogether it yields a score of at least 10. 

On the other hand consider the case that no “x” in Y^ i matches a gap in Z. 
Then there is a “o” in Yi^i or Z matching a gap in the other sequence. So the 
alignment yields a score of 3 for each “x” plus 1 for each plus 2 for the “o” 
matching a gap. So the alignment again yields a score of at least 10. 

The statement about Yi ^2 and Z can be proved in a similar fashion. □ 

Lemma 7. Alignments of the pairs {Yil,Xa^ j} and 2 , 3 } yield scores of 

6 and 3, respectively, if they fulfil property B. Violating property B yields scores 
of at least 8 and 5, respectively. □ 

With the fact that an optimal alignment of a pair {Fj 1 , F^ 2 } has score 7 we 
can prove the following. 

Lemma 8 . Given an arbitrary alignment with score 31-t — 2-g we can construct 
a variable- consistent alignment with less or equal score in polynomial time. 

Proof. Let A be an arbitrary alignment with Dw{A) = ■ t — 2 ■ g. 

Let / be the set of all i such that Yi^i and F ^_2 fulfil properties A and B. 
This implies an assignment for the variables Ui = {xj G U\3i G I : Xj G Si}. 
Let I = {1, . . . ,t} \ I. Because in every set Si tor i G I there exists a sequence 
Yi^q that violates property A or B, we have DfiA) > 31 for each i G I due to 
Lemmas 6 and 7. 

For i G I it G Ui {q G {1,2}), we realign Yi^q with respect to 
Then we assign an arbitrary value to the variables in U \ Ui and realign the 
corresponding Yi q and Xj. 

By these modifications we obtain an alignment A' . Then DfiA') = DfiA) 
tor i G I and DfiA') < 31 < DfiA) otherwise. Thus, Dw{A') < Dw{A). A' is 
variable-consistent due to its construction and can be computed in polynomial 
time. □ 

The alignment obtained yields an assignment that satisfies at least g equa- 
tions of Q. 



Theorem 3. BMSA is MAX SMV-hard. 




84 



Bodo Siebert 



Proof. We reduce MAX-E2-neg-Lin2 to BMSA. /i is given by the construction 
of S from a family Q of t equations. One can see that opt(5) < 31 • t. 

An equation of G will be satisfied by 2 of the 4 possible assignments of its 
variables. Therefore, for every multiset G oft equations an assignment exists that 
satisfies at least | • t equations. Then for 71 = 62 we have opt(5) < 71 • opt(t/). 

Given an alignment of S with score 31 ■ t — 2 ■ g' for some g' we can find an 
assignment satisfying g > g' equations of G due to Lemma 8. Let 72 = 5, then 
\g — opt(t/)| < 72 • 1(31 ■ t — 2 ■ g') — opt(5)| holds. □ 



Theorem 4. BMSA has no polynomial time approximation with performance 
ratio III — e for any e > 0, unless AfV = V. 

Proof. An instance of MAX-E2-neg-Lin2 consisting of t equations is 77-satisfiable 
iff the corresponding instance of BMSA has an alignment with score (31 — 2-77) -t. 

The optimal alignment of a BMSA instance corresponding to a (|| — f)- 
satisfiable instance of MAX-E2-neg-Lin2 has score (31 — 2-(||— ^))-t = 3?3+2^, 
t. Using the (||| — e) -approximation algorithm for BMSA we are able to find 
an alignment with score at most (||| — e) • 3^3+22.; ^ ^ 

The optimal alignment of a BMSA instance corresponding to a (|| -I- ^)- 
satisfiable instance of MAX-E2-neg-Lin2 has score (31 — 2 - (^ + 'C)) ■ t =: K 2 . 

We have Ki < K 2 iS f ^ ' 647^323-e • Choose f with 0 < ^ < ^ ' 647^323-e • Then 
the (m — e) -approximation for BMSA can be used to distinguish (|| — ■?)- and 
(M T 'C)“Satisfiable instances of MAX-E2-neg-Lin2. This would imply J\fP = P 
due to Theorem 2. □ 

Since WMSA is a generalization of BMSA it is also MAX SMV-haid and we 
obtain the same non-approximability result. 



6 Conclusions 

We have shown MAX iSAfP-hardness and proved a numerical lower bound for 
the approximability of weighted multiple sequence alignment (WMSA). These 
results hold even if we restrict the problem to binary weights (BMSA) . Further- 
more, BMSA and WMSA are equivalent with respect to their approximability. 
But the distance to the best known upper bound is huge. An obvious goal is to 
reduce this gap. 

Finally, we would like to know how well the unweighted version of the multiple 
sequence alignment problem with metric SP-score can be approximated. 



Acknowledgements 

I wish to thank Martin Bohme, Andreas Jakoby, and Rudiger Reischuk for valu- 
able discussions and comments on this paper. 




Non-approximability of Weighted Multiple Sequence Alignment 



85 



References 

1. T. Akutsu, H. Arimura, and S. Shimozono. On approximation algorithms for local 
multiple alignment. In Proc. Ann. ACM Int. Conf. on Comput. Mol. Biol, 
pages 1-7, 2000. 

2. S. Arora, C. Lund, R. Motwani, M. Sudan, and M. Szegedy. Proof verification and 
the hardness of approximation problems. J. ACM, 45(3):501-555, 1998. 

3. G. Ausiello, P. Crescenzi, G. Gambosi, V. Kann, A. Marchetti-Spaccamela, and 
M. Protasi. Complexity and Approximation. Springer- Verlag, 1999. 

4. V. Bafna, E. L. Lawler, and P. A. Pevzner. Approximation algorithms for multiple 
sequence alignment. Theor. Comp. Sc., 182(l-2):233-244, 1997. 

5. Yair Bartal. On approximating arbitrary metrics by tree metrics. In Proc. 30th 
Ann. Symp. Theor. Comput., pages 161-168. AGM, 1998. 

6. H. Garrillo and D. Lipman. The multiple sequence alignment problem in biology. 
SIAM J. Appl. Math., 48:1073-1082, 1988. 

7. J. Hastad. Some optimal inapproximability results. In Proc. 29th Ann. Symp. 
Theor. Comput., pages 1-10. ACM, 1997. 

8. T. Jiang, P. Kearney, and M. Li. Some open problems in computational molecular 
biology. SIGACT News, 30(3):43-49, 1999. 

9. W. Just. Computational complexity of multiple sequence alignment with SP-score. 
Technical report, Ohio University, 1999. 

10. C. H. Papadimitriou and M. Yannakakis. Optimization, approximation, and com- 
plexity classes. J. Comp. Sys. Sc., 43(3):425-440, 1991. 

11. L. Trevisan, G. B. Sorkin, M. Sudan, and D. P. Williamson. Gadgets, approxima- 
tion, and linear programming. SIAM J. Comput., 29(6):2074-2097, 2000. 

12. L. Wang and T. Jiang. On the complexity of multiple sequence alignment. J. 
Comp. Biol, l(4):337-348, 1994. 

13. B. Y. Wu, G. Lancia, V. Bafna, K. Chao, R. Ravi, and C. Y. Tang. A polynomial- 
time approximation scheme for minimum routing cost spanning trees. SIAM J. 
Comput., 29(3):761-778, 1999. 




A Greedy Algorithm for Optimal Recombination 



Shiquan Wu and Xun Gu* 

Center of Bioinformatics and Biological Statistics 
Iowa State University, Ames, lA 50011, USA 
{sqwu,xgu}@iastate . edu 



Abstract. Let E be an alphabet and E'^ denote the collection of all 
sequences of length n over E. For any si = aifl2 • • • ajOj+i • • • a„, S2 = 
bib 2 ■ ■ ■ bj 6j+i ■■ - bn £ H", a recombination of si and S2 at position j is 
defined as an operation that crosses si and S2 at position j and generates 
ti = aifl2 • • • ajbj+i ■ ■ - bn and t2 = 6162 • • • bjaj+i • • • a„. Denote A and S 
two collections of sequences. In this paper, we discuss generating A from 
iS by a series of recombinations in minimum number of steps. We present 
a greedy algorithm for finding the optimal recombination evolutionary 
history from S to any tree A of sequences when |iS| = 2. 



1 Introduction 

Various types of mutations on sequences play an important role in computational 
biology. Transformations using insertion, deletion, substitution, and reversal are 
widely studied by applying statistics and algorithms (cf. [1,2]). Recently, much 
attention has been paid to recombination of sequences. Hein developed some 
algorithms for recombination problems(cf. [3,4]). Kececioglu and Gusfield dis- 
cussed a recombination distance problem on generating a third sequence from a 
given pair of sequences in optimal recombination cost (cf. [5]). 

Generally, an edit distance problem of genomes is widely studied and its 
aim is to find the optimal evolutionary history from some ancestors to some 
descendents by using certain types of mutations such as insertion, deletion, sub- 
stitution (point mutation), reversal, etc. Parsimony trees and phylogenetic trees 
are some intersting problems in the category. In [6], an alignment with recombi- 
nation is discussed and it is an edit distance problem involved recombinations. 

In this paper, we discuss a similar distance problem involved recombinations. 
The purpose is to generate a collection A of sequences from another collection 
S of sequences by a series of recombinations in minimum number of steps. 



I . 1 Problem and Example 

Definition 1 . Let E be an alphabet (e.g., E = {a,c,g,t}, etc) and V" the 
collection of all sequences of length n over E. For any Si = 0102 • • • ajttj+i ■ ■ ■ a„, 
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S2 = bib2 ■ ■ ■ bjbj+i ■ ■ ■ bn G A"”, a recombination for Si and S2 at position j 
(1 < j < n — 1) is defined as the following operation: 

From Si =0102 • • -aj aj+i ■ ■ ■ an 
S 2 =&i&2 ■ ■ ■ bj bj + i ■■ - bn 
to ti =ai02 • • -Oj bj+i ■ ■ - bn 
t2 =blb2 ■■■ bj Qj + i ■■■an - 

It is denoted i?(si, S2; j; ti, t2), or i?(si, S2; j). Both Si and &2 are called par- 
ents and both ti and t2 are called children. 



Definition 2 . Let 5 C 17 ". S is called a recombination ancestor of A if 
A can be generated from S by a series of recombinations, i.e., there exist a 
series of recombinations (i = 1 , 2 , •••, p) such that 



(i) Ai) 



(IM 

p}. 



G 5 U {ti‘ 
(i) „(*). i(i). 



(fc) (fc). 



1 < A: < f - 1}, (2)A C SU 1 < i < 

^2*^) (i = is called a recombination process 

generating A from S. It is called optimal if the number p of recombination steps 
is minimized among all such possible recombination processes. 



Recombination Problem 1 . Let A C 17" and S a recombination ancestor of 
A. Find the optimal recombination process generating A from S. Denote n{S, A) 
the number of optimal recombination steps generating A from S. 

Example 1. Let A = {ai = 01010, a2 = 10001, as = 10110, ai = 11001} and 
S = {si = 00101,82 = 11010}. We have n{S,A) = 4 and the following is an 
optimal recombination process generating A from S. 

S2; 1; ai, bi): From 0 0101 =6i=si 
1 1010 =62=82 

To 0 1010 =&3=ai 

I 0101 =64 

i?2(b4,S2; 2; bs, bg): From 10 101 =64 

II 010 =62 
To 10 010 =65 

11 101 =be 

i?s(b4, S2; 3; as, a4): From 101 01 =64 

110 10 =62 

To 101 10 =&7=as 
110 01 =bs=&4 

i?4(si, bs; 3; a2, bg): From 001 01 =61 

100 10 =65 

To 001 10 =69 

100 01 =6io=a2 



If S consists of only two sequences, then Problem 1 can be regarded as one with 
|i7| = 2. For binary case, S = {00 • • • 0, 11 • • • 1} always generates an arbitrary 
binary sequence by recombinations. So it is a recombination ancestor for any 
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"4 C {0, 1}". Moreover, given any recombination ancestor S = {snSi 2 ■ ■ ■ Sin\i = 
1,2} of A = {anai 2 ■ ■ ■ ain\i = 1,2, •••,/c}, for each 1 < j < n, we define a 
function fj on {sij,S 2 j}(sij ^ S 2 j) such that fj(sij) = 0 and fj{s 2 j) = 1- Then 
we get Sq = {/l(sil)/ 2 (si 2 )---/n(si«),/l(s 2 l)/ 2 (s 22 )---/n(s 2 n)| = {00 • • • 0 , 
11 •••1} and At) = |/i(oii)/ 2 (ai 2 ) • • • /„(ai„)|i = l,2,---,fc|. And n{S,A) = 
n(5o,Mo). For |A| = 2, Recombination Problem 1 is reduced to the following 
simple form. 

Recombination Problem 2. Let A C (0, 1}" be an arbitrary collection of 
binary sequences and S — {00 • • • 0, 11 • • • 1}. Find the optimal recombination 
process generating A from S. 

In this paper, we discuss Problem 2 and find algorithms for it. 

1.2 Terminology and Notation 

Definition 3 . Let a = 0102 ■ ■ ■ ap ■ ■ ■ Uq ■ ■ ■ an € {0, 1}". I = a[p, q] = OpOp+i • • • 
aq is called an alternative segment of a if at yf a^+i for all p < i < q — 1. An 
alternative block is a maximal alternative segment. Denote /a = {I1A2, ' ' ' Ak\ 
the collection of all alternative blocks of a. Define the core of a as Ca = 
which is the minimum segment of a containing all 7^(1 < i < k). Define the 
length ofa[p,q] as l{a[p,q\) = q — p and the length of as l{Ia) = X)y=i^(4j)- 
Denote V{A) = {p\\ < p < n — 1, Op ^ Op+i for some a G A}. For example, 
if a= 11 10101 1 1010 000100, then I) = a[3,7] = 10101 , I2 = 1010 , h = OlO- 
7a = {7i,72,73|. Ca = a[3, 17] = 10101 1 1010 00010. l{h) = 4 and 1 ( 1 ^) = 9. 

Definition 4. Let a, b G {0, 1}” with Ca = a[p, q] and Ch = b[u, w]. If u < p < 
q < V and a[p,q] = h[p,q], we say that b covers a, denoted by a Qh.h and 
a are called disjoint or independent if q < u, or v < p, denoted by a C\h — 0. 
If q ^ u, we say a < b. For any alternative segments ( or blocks) I and J, we 
similarly define I Q J, I C\ J = %, and I < J. 

Definition 5. Denote C{A) = {Caja G A}. A is called a nest of sequences if 
a C b, or b C a for any a, b G A. It is called a tree of sequences if either 
aflb = 0, or a C b, or b C a for any a, b G A. For a, b G A, z/b C a and there 
exists no c G A such that b C c C a with c yf a and c yf b, then b is called 
a branch of a. Denote i?a Ihe collection of all branches of a in A. A branch is 
called a leaf (or youngest branch) if it has no branch in A. a G A is called a root 
(or oldest branch) if it is not a branch of any other sequences in A. 



1.3 Related Work 

Our recombination problem is a generalization of the following problem(cf. [5]). 

Recombination Distance. Given a = ai02 • • • a^, b = 61 &2 • • and c = 
C1C2 • • • Cfc, find the minimum cost recombination to produce c from a and b. 




A Greedy Algorithm for Optimal Recombination 



The goal is to generate a third sequence from a given pair by recombinations 
consisting of multiple crossovers and point mutations. In our problems, we de- 
fine a recombination consisting of a single crossover with no point mutations. We 
discuss a more general problem to construct the optimal recombination span- 
ning history from one family of sequences to another one. Its general case with 
multiple crossovers and point mutations is NP-complete(cf.[5,6]). 

2 Theorems and Algorithms 

In this section, we show some theorems on optimal recombination processes and 
design a greedy algorithm for finding the optimal recombination process for a 
tree of binary sequences. We always assume S = {00---0, 11---1}. 

Theorem 1. Let a G {0, 1}”. Then n(5, {a}) = l{Ia)- 

Proof, a is optimally generated by i?(si, S2; p; bi, Ci), i?(bi, Ci; p -|- l;b2,C2), 

(b2 , C2 , p -f 2, b3 , C3) , , 7?(bg_p_ I , Cq—p—ij Q 1, bg_p, Cg_p) , • * * over all 

positions in P({a}). Therefore, n{S, {a}) = 1(1^). 

The series of recombinations in the proof is called a recombination extension 
and denoted by ext{si,a). Generally, if a C b and a has been generated, then b 
can be generated by series of recombinations on the positions in P({b}) but not 
P({a}), called an recombination extension from a to b and denoted by ext{a, b). 
Theorem 2. If A is a nest, then n(5,A) = \V{A)\. 

Proof. By Theorem 1 and recombination extensions. 

Theorem 3. Let Ai,A 2 C {0, 1}” be independent nests, i.e., aflb = 0 for any 
a G A\,h G A 2 . Then n{S,A\ U A 2 ) = \V{Ai)\ + \'P{A 2 )\. 

Proof. Apply Theorem 2 to Ai and A 2 , respectively. 

Theorem 4. Let Ai,A 2 C {0,1}” be independent nests and a G {0,1}”. For 
any bi G A\ and b2 G A2, < i >2 and {^}bi, b2 C a. Then n{S,Ai U A2 U 

{a}) = |P({a})| + l. 

Proof. Choose a position p between Ai and A 2 , then Ca is partitioned into Ci 
and C 2 - Choose ai and a2 with Ca^ = Ci and Caj = C 2 . Then the theorem 
follows by applying Theorem 2 to both Ai + {ai} and A 2 + {a2} with one more 
recombination i?(ai,a2;p), i.e., |P({ai})| -|- |P({a2})| -I- 1 = |P({a})| -|- 1 steps. 

Theorem 5. Let A be a tree. Then n(5, A) = \V{A) \ + ~ ^)- 

Proof. By Theorem 4 and induction on |A|. Choose a root a G A and denote 
Ba = {di < d2 < • • • < dp}. We define ea;t(di, d2, • • • , dp; a) as the following 
recombination process: (l)Choose a position pi between and d^+i (1 < f < 
p— 1), (2)Partition Ca by all pi into Ci and choose b^ with Cb^ = Ci(l < i < p), 
(3)Make ext{di,hi) to get b^ from di(l < i < p), (4)Make the recombinations: 
i?(bi,b2;pi;fi,gi) and b^; p*; fi, g*) (2 < i < p - 1) with b^ C (1 < 

j < i — l). Then fp_i = a. ext(di, d2, • • • , dp; a) generates a from di, d2, • • • , dp 
in optimal steps. A is then generated from branches to roots by induction. 
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Based on Theorem 5, we now design a greedy algorithm for finding the op- 
timal recombination spanning evolutionary history generating a tree A from S. 

Greedy Algorithm 

Input: A (nest/tree). 

Output: Optimal recombination process. 

Step 1 Find i?a for all a G A and set E = %. 

Step 2 While A — E { 

Choose a leaf (youngest branch) aG A — E and 
denote = {di < d 2 < • • • < dp}. 

Generate a by ea;t(di, d 2 , • • • , dp; a). 

E = E+{a}. 

} 

Theorem 6. Greedy Algorithm finds the optimal recombination process for a 
tree A and the run time is 0{\A\'^n). 

Proof. For each pair a and b of sequences, the algorithm compares n positions to 
determine whether a C b or not. There are 0(|Ap) pairs of sequences. Therefore 
the run time is 0(\A\‘^n}. 

3 Conclusion 

The greedy algorithm is designed to generate a tree from two sequences. The 
most interesting part of the problem is to generate an arbitrary collection of 
sequences from a given recombination ancestor. Our discussion may be applied to 
some problems in human SNP (single nucleotide polymorphism) genome project. 
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Abstract. A d-dimensional simplicial mesh is a Delaunay triangulation 
if the circumsphere of each of its simplices does not contain any vertices 
inside. A mesh is well-shaped if the maximum aspect ratio of all its 
simplices is bounded from above by a constant. It is a long-term open 
problem to generate well-shaped d-dimensional Delaunay meshes for a 
given polyhedral domain. In this paper, we present a refinement-based 
method that generates well-shaped d-dimensional Delaunay meshes for 
any PLC domain with no small input angles. Furthermore, we show that 
the generated well-shaped mesh has 0{n) d-simplices, where n is the 
smallest number of d-simplices of any almost-good meshes for the same 
domain. A mesh is almost-good if each of its simplices has a bounded 
circumradius to the shortest edge length ratio. 

Keywords: Mesh generation, Delaunay triangulation, well-shaped, as- 
pect ratio, radius-edge ratio, computational geometry, algorithms. 



1 Introduction 

Mesh generation is the process of breaking a geometry domain into a collection 
of primitive elements. In this paper we exclusively consider d-dimensional sim- 
plicial Delaunay meshes. The aspect-ratio of a mesh is the maximum aspect-ratio 
among all of its simplicial elements. A mesh is well-shaped if its aspect ratio is 
bounded from above by a small constant. The aspect ratio of a simplex is usually 
defined as the ratio of its circumradius to its inradius. Generating well-shaped 
d-dimensional Delaunay meshes is one of the long-term open problems in mesh 
generation when d > 2. An alternative but weaker quality measurement is to 
use the radius-edge ratio [8]. It is the circumradius divided by the shortest edge 
length of the simplex. The radius-edge ratio of a mesh is the maximum radius- 
edge ratio among all of its elements. A mesh is almost-good if it has a small 
radius-edge ratio. Numerous methods [3,8,9,10] guarantee to generate almost- 
good 3-dimensional Delaunay meshes. 

Bern et al. [1] showed that any set of n d-dimensional points had a Steiner 
Delaunay triangulation with simplices, none of which has an obtuse 

dihedral angle. No bound depending only on n is possible if we require a bound 
on the minimum dihedral angle [1]. We assume a d-dimensional piecewise lin- 
ear complex (PLC) domain 12 as the input. Shewchuk [11] showed that if each 
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/c-dimensional constraining facet in i? with k < d—2 is a union of strongly Delau- 
nay fc-simplices, then Q has a d-dimensional constrained Delaunay triangulation. 
The Delaunay refinement method [3,9,10] can be extended to d-dimensions to 
generate almost-good Delaunay meshes if 17 satisfying the projection lemma [10] 
and having small input angles. However, it encounters significant difficulties in 
producing well-shaped Delaunay meshes in 3D. Chew [4] first proposed to add 
a vertex inside the circumsphere of any given badly-shaped tetrahedron (sliver) 
to remove it. The difficulty is in proving the existence of, and finding, a Steiner 
vertex that does not itself result in slivers. To make it possible. Chew defined 
a tetrahedron to be a sliver if it has a large aspect ratio, a small radius-edge 
ratio and its circumradius is no more than one unit. Without the restriction on 
the circumradius length, almost all results in [4] donot hold any more. In addi- 
tion, with this restriction, the termination of his algorithm is straightforward. 
Recently, Li [6] extended this algorithm and showed how to generate well-shaped 
non-uniform Delaunay meshes in 3D. 

In this paper, we show how to generate d-dimensional well-shaped meshes for 
a PLC domain with no small angles. In Section 2, we review the basic concepts 
and define what is a d-dimensional sliver simplex. ^ Then we give an algorithm 
in Section 3 to generate d-dimensional well-shaped Delaunay meshes. It basically 
adds a point around the circumcenter of each of d-simplices containing any k- 
dimensional sliver (hereafter, fc-sliver). We prove its correctness in Section 4 and 
its termination guarantee in Section 5 by showing that the distance between the 
closest mesh vertices is just decreased by a constant factor compared with that 
of the input mesh. In Section 6, we show that the size of the generated mesh is 
within a constant factor of the size of any almost-good mesh generated for the 
same domain. Section 7 concludes the paper with discussions. 



2 Preliminaries 

After inserting a new vertex p, every new d-dimensional simplex created in the 
Delaunay triangulation of the new vertex set has p as one of its vertices. The 
new triangulation can be updated by efficient operations local to the vertex p. 
A sphere centered at a point c with a radius r is denoted as (c, r) hereafter. It is 
called empty if it does not contain any mesh vertices inside. The nearest neigh- 
bor graph defined by a d-dimensional vertex set is contained in the Delaunay 
triangulation of the vertex set. Thus the shortest edge length of the Delaunay 
triangulation is the closest distance among mesh vertices. This fact is used in 
proving the termination guarantee of our algorithm. 

Delaunay refinement methods have been shown to be effective in generating 
almost-good meshes in 2 and 3 dimensions [9,10]. There is also no much diffi- 
culty to extend them to d-dimensions, if the input domain satisfies a projection 
lemma [10], to generate meshes with radius-edge ratio no more than \/2 

^ Surprisingly, a simple extension of 3-dimensional sliver definition to d-dimensions 
does not work here. 
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For a fc-simplex /i, its min-circumsphere is the smallest d-dimensional sphere 
containing the vertices of /i on its surface. A point encroaches the domain bound- 
ary if it is contained inside the min-circumsphere of a boundary fc-simplex /i. 
Here a fc-simplex /i is a boundary simplex if it belongs to the Delaunay tri- 
angulation of a fc-dimensional input boundary polyhedral face. A fci-simplex ti 
directly encroaches another fc 2 -simplex T 2 if the circumcenter of t\ encroaches T 2 - 
A fci-simplex t\ indirectly encroaches another /c 2 -simplex T 2 if the circumcenter 
of Ti encroaches a simplex /i and p, directly or indirectly encroaches T 2 . Assume 
the circumcenter of a /ci-simplex r encroaches a boundary fc 2 -simplex /x. Call r 
the encroaching simplex and p, the encroached simplex. Assume that p contains 
the projection of the circumcenter of r inside. Then Shewchuk [10] showed that 
the circumradii of t and p satisfy that > ^Rt- 

A d-simplex is bad if it has a large aspect ratio. Let’s consider a fc-simplex 
T, where 1 < fc < d. For later convenience, we use i?r, Lr and p(r) = Rr/Lr to 
denote the circumradius, the shortest edge length and the radius-edge ratio of 
T. Let V be its volume. We define a = a{r) = VjL^ as a measure of its quality. 
Let go and ak, 1 < k < d he positive constants that we specify later. 

Definition 1. A k-simplex r is well-shaped if p{r) < go and a(r) > Ok- 



Definition 2. [Sliver] Call a k-simplex t a fc-sliver if p{t) < go, cr(r) < ak, 
and each of its facets is well- shaped if k > 3. 



We call a d-simplex r a sliver- simplex if it contains a fc-sliver. If a d-simplex 
T has a small radius-edge ratio and a small a value, then it must have a face x 
that is a sliver or r itself is a d-sliver. It is not difficulty to prove that the volume 

_orr<i 



of a d-simplex r is at most (fidRr, where ipd = ‘^ Yli=2 
aspect ratio of r is then at most 



The 



This verifies our definition of cr. 



3 Refinement Algorithm 

For a fc-simplex r, we call the intersection of its min-circumsphere with the affine 
space defined by its vertices as its fc-sphere. When refine a fc-simplex r, we add 
a point p inside the shrinked fc-sphere {ct,SRt), where d < 1 is a constant. 
The point p is good if its insertion will not introduce any small slivers in the 
new Delaunay triangulation. Here a created sliver p is small if < bRr for 
a constant b specified later. We call the solid fc-dimensional ball {cr,6Rr) the 
picking region of r, denoted by V{t). Its volume is (jikd^Rr, where <fk = rETVT- 

Algorithm: Refinement (go, ao, S, b) 

Enforce Empty Encroachment: Add the circumcenter Cr of any encroached 
boundary simplex r and update the Delaunay triangulation. If Ct encroaches 
any lower dimensional boundary simplex p, add instead of Ct- 
Clean Large Radius-Edge Ratio: Add the circumcenter Cr of any d-simplex 
T with a large p(r) and update the Delaunay triangulation. If Cr encroaches 
any boundary fc-simplex, we apply the last rule instead of adding Cr- 
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t 





Fig. 1. Left: a 4-dimensional sliver example; Middle: the picking region of a 
simplex; Right: the forbidden region of /r with circumradius Y . Here R — 2qqY 
and D — 2kaoY. 



Clean the Slivers: For a sliver-simplex r, add a good point p G V{t) and 
update the Delaunay triangulation. If the circumcenter Ct encroaches the 
domain boundary, we apply the following rules instead of finding p. If the 
insertion of p introduces some new d-simplices with large radius-edge ratio, 
we apply the previous rule to eliminate them immediately. 

Encroach Boundary Simplices: If a boundary fc-simplex fi is encroached di- 
rectly or indirectly by a d-simplex with a large radius-edge ratio, add the 
circumcenter and update the Delaunay triangulation. However, if en- 
croaches any other lower dimensional boundary simplex pi, we insert the 
circumcenter of instead of adding c^. 

If a boundary fc-simplex p is encroached directly or indirectly by a sliver- 
simplex, add a good point p G V{p) and update the Delaunay triangulation. 
However, if encroaches any other lower dimensional boundary simplex p\^ 
we add a good point from P{pi) instead of adding p. 

The key part of the algorithm is to find a good point p to refine a sliver- 
simplex T. We select some k random points from V{t). Then choose the point 
that optimizes the quality tr of all created small simplices with a radius-edge 
ratio less than go- Another approach is based on a randomized selection [4,6,7]. 
We randomly select a point p G V{t) until we find a good point p. By defining 
sliver and small slivers properly, we can show that we are expected to find a 
good point p in constant rounds. Then the rest of the paper is devoted to prove 
the termination guarantee and the good grading guarantee of the algorithm. 

4 Proofs of Correctness 

4.1 Sliver Regions 

Recall that a fc-simplex r is a sliver if pir) < go, cr(r) < ak, and all of its facets 
are well-shaped. The quality measure (t(t) is related with a distance-radius ratio 
defined in [2,5]. Consider any vertex p of r. Let p be the facet formed by other 
vertices of r. Let D be the Euclidean distance of point p from the hyperplane 
passing through p. Recall is the circumradius of p. 
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Lemma 1. For any k-simplex t, -^ < 2k'^^. 

Proof. It follows from the volume of r: P = a{r)L’^ = ED 

A ^-simplex r is a sliver if p(r) < Qq and cr(r) = < (Tq for a, small constant 

(To, i.e., (Tfc = (Jq. Consequently, we have < 2kao if r is a fc-sliver. 

Given a well-shaped /c-simplex r, a point p that forms a (/c-|-l)-sliver together 
with T can not be anywhere. It must be around the circumsphere of t and is not 
far-away from the hyperplane passing r. We call the locus of such point p as the 
sliver region or forbidden region Fr of the simplex r. It is easy to show that Fr 
is contained in the solid sun-hour glass shaped region as illustrated in the right 
figure of Figure 1. Let \Ft\ denote the volume of FV- 

Lemma 2. For any well-shaped k-simplex t, |FV|<CdCrg“^“'"^i?^, where Cd is a 
constant depending only on go, k, and d. 

Proof. We know that FV is inside the slab region defined by the two hyper- 
planes iFi and iF 2 illustrated by the right figure of Figure 1. Let V 2 , r\ be the cir- 
cumradius of the two spheres intersected by iFi and the circumsphere of t. Here 
F 2 = ||cp|| and ri = ||c(j'||. It is easy to show that < {1 — {2kao)‘^ -\- SkaoQo) ■ 
and > (I — (2F(7o)^ — 8fc(Jo^?o) • The volume of a fc-sphere with the radius 
r is 4>kr^. Then 

Notice Vz > 1, — rf^'^ < (r| -I- z’^)(r 2 — r\). In addition, r| -|- rj = 

2(1— (2fc(To)^)F^ < 2Y‘^, and < IdkgoaoY"^ . The fact that V 2 >Y implies 

that z -2 — ri < ia>kgo<JoY ■ By induction, we have r^—r^ < 2^^/'^^'^'^kgocyoY^ . 
Consequently, we have |Ft| < 4>k4>d-k‘2‘''^^'^^^^kgo(Jo{‘^kaoY~^Y'^ . The lemma 
follows by setting Cd = Vo > 4)k(t>d-k‘^''^^'^^^^kgo{2kY~'^ . □ 

A good point p used to refine a fc-simplex r cannot be inside F^ fl V{t) for 
any well-shaped m-simplex /i, We use F^^k to denote such intersection region. 

Lemma 3. For any k-simplex t and a well-shaped m-simplex h, |F)j^fe| < CfcCToi?^, 
where Ck is a constant that depends only on go, d and k. 

4.2 Existence 

We then show that given a /c-simplex r in an almost-good (i-dimensional sim- 
plicial Delaunay mesh, there is a good point p in V{t). Let S{t) be the set of 
simplices in any dimensions that each can form a small Delaunay sliver with a 
point from V{t). A created sliver p, is Delaunay if it belongs to the Delaunay 
triangulation by inserting a point from P(r); it is small if < bRr- 

We first recall some results from Talmor et al. and extend it to cf-dimensions. 
For a mesh vertex v, the edge length variation, denoted by v{v), is defined as the 
length of the longest edge incident on it divided by the length of the shortest 
edge incident on it. Talmor [12] proved that, given an almost-good mesh in 3D, 
the edge length variation of each mesh vertex is at most a constant depending 
on the radius-edge ratio of the mesh. This result is extended to d-dimensions. 
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Lemma 4. Given any vertex v of a d-dimensional almost good mesh, v{v) < vq, 
where vq is a constant depending on the mesh’s radius-edge ratio qq. 

Lemma 5. Given a k-simplex r of an almost-good mesh, the number of the 
vertices of S{t) is at most a constant depending on wq and b. 

Proof. Assume that a well-shaped m-simplex p. forms a Delaunay sliver to- 
gether with a point p G P{t). The edges of p have lengths at most < 2bRr- 
The edges incident on any vertex q of p before point p is introduced have length 
at least 2bRr/nQ. It implies that the closest distance among all vertices from 
S{t) is at least 2bRr/vo- It is simple to show that all such vertices are inside the 
sphere centered at c,- with a radius {6-\-2b)Rr- Then by a volume argument, the 
number of vertices of S{t) is a constant. ED 

Thus the number of simplices in S{t) is also a constant, let’s say, W. 

Theorem 1. Good point exists in any k-simplex r of an almost-good mesh. 

Proof. Each simplex p from S{t) claims a forbidden region with volume at 
most Cfe • (ToR^. The circumradius of p is at most bRr- The volume of V{t) is 
(fkd^R^. Therefore, if we select (Tq such that W ■ cuCfQh^R^ < (t>k5^R^, then the 
pigeonhole principal will guarantee the existence of a good point p € V{t). EQ 



5 Termination Guarantee 

A fc-sliver is called created if it contains at least one Steiner vertex; otherwise 
it is called original. We classify the bad d-simplices into three categories. The 
simplices containing an original k-sliver are called original sliver- simplices. The 
simplices containing only created k-slivers are called created sliver-simplices. The 
third category has d-simplices with a large radius-edge ratio. Let e(r) be the 
shortest edge length introduced by eliminating a d-simplex t. Here e(r) could 
be less than Lr for a sliver-simplex r. For simplicity, we assume that all original 
sliver-simplices are removed first. 

Lemma 6. For an original sliver- simplex p, e{p) > Lt ■ 

Lemma 7. For a simplex r with ^ > qq, e(r) > ^d-i Lr. 

The proofs are similar to the 3-dimensional counterpart, which are omitted. 
It remains to show that refining any created sliver-simplex will not introduce 
shorter edges to the mesh. Let’s consider a sliver-simplex p. Assume it contains 
a fc-sliver t created by inserting a point from the picking region of an element, 
say /(r). Element /(r) is called the parent of the fc-sliver r. There are three 
cases: /(r) is a sliver-simplex; /(r) is a d-simplex with a large radius-edge ratio; 
/(r) is an encroached boundary simplex. 
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Lemma 8. Assume a sliver-simplex fj, contains a k-sliver r created by splitting 
another sliver- simplex /(r). Then e{p,) > ‘ 

Lemma 9. Assume a sliver-simplex /i contains a k-sliver t, and parent /(t) 
has a large radius-edge ratio. Then e{fT) > • Lf(^T-y 

Lemma 10. Assume a sliver-simplex fi contains a sliver r created by splitting 
a boundary m-simplex x that is encroached directly or indirectly by p{x)- Then 

1. e{p.) > ^)eoLp(x) ^ If d-gijjipigx p[x) has a large radius-edge ratio. 

2. e{p) > , if d- simplex p{x) is a sliver- simplex. 

v2 

Combining all above analysis, we know that the shortest distance between 
all mesh vertices is at least factor of that of the original mesh if we choose 

b, (5 and qq such that (1 — S)b > and (1 — 6)go > 2'^. We then have the 

following theorem by a volume argument. 

Theorem 2. The algorithm terminates in generating well-shaped Delaunay 
meshes. 



6 Good Grading Guarantee 

As [6,7,10], we study the relation between the nearest neighbor function defined 
by the final mesh and the local feature size function defined by the input do- 
main. The local feature size lfs{x) of a point x is the radius of the smallest ball 
intersecting two non-incident segments or vertices. The nearest neighbor N(x) 
of a point x is its distance to the second nearest mesh vertex. 

With each vertex v, we associate an insertion edge length e„ equal to the 
length of the shortest edge connected to v immediately after v is introduced 
into the Delaunay mesh. Notice that v may not have to be inserted into the 
mesh actually. If v is an input vertex, then is the distance between v and its 
nearest input neighbor. So > lfs{v) from the definition of lfs{v). If v is from 
the picking region of a sliver-simplex t, then (1 -|- 6)Rr > e„ > (1 — 6)Rr. The 
inequality (l-|-(5)i?i- > e„ comes from the fact that there is a mesh vertex on the 
circumsphere of t. 

For the sake of convenience of analyzing, we also define a parent vertex p(y) 
for each vertex v, unless v is an input vertex. Intuitively, for any noninput vertex 
v, p{v) is the vertex “responsible” for the insertion of v. We discuss in detail what 
means by responsible here. Assume v is inserted inside the picking region of a 
simplex /i. If simplex p, has p{p) > go, then p{v) is the most recently inserted 
end point of the shortest edge of p. If /i is a sliver-simplex containing an original 
fc-sliver r, then p{v) is an end point of the shortest edge of t. If /x is a created 
sliver-simplex containing a created fc-sliver r, then p(v) is the most recently 
inserted vertex of r. If /x is an encroached boundary fc-simplex, then p(v) is the 
encroaching vertex. For the sake of simplicity, we always use the almost-good 
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mesh generated by Delaunay refinement method as input mesh. Therefore the 
boundary simplices can not be encroached by input vertices. Ruppert [9] and 
Shewchuk [10] showed that N{) defined on the mesh generated by Delaunay 
refinement method is within a constant factor of lfs{v), i.e., N(v) ^ lfs{v). 

Lemma 11. Let v be a vertex in the final mesh and let p = p{v). Then we have 
Cv > lfs{v) for an input vertex v; and Cy > C ■ Cp for a Steiner point v, where C 
is a constant specified in the proof. 

Proof. If v is an original input vertex, then > lfs(y) from the definition 
of lfs{v). Thus the theorem holds. Then consider a non-input vertex v. We first 
consider that v is selected from the picking region of a d-simplex; say p. 

Case 1.1: pis a, d-simplex with large radius-edge ratio p{p) > Qq. The parent 
p is one of the end points of the shortest edge of p. Let Lp be the length of 
the shortest edge pq of p. Then q is an original vertex or is inserted before p. 
Therefore, Cp < \\p - g|| = Lp < ^. Thus Cy = Rp > go ■ Cp. 

Case 1.2: ^ is a sliver-simplex containing an original /c-sliver r. The parent 
p = p{v) is one of the end points of the shortest edge of t. Let Lr be the length 
of the shortest edge pq of r. Similar to the previous case, we have Cp < \\p — g|| = 
Ly. Notice that Rp > Rt > Lt/ 2. Thus > (1 — S)Rp > • Cp. 

Case 1.3: pis a sliver-simplex containing a created fc-sliver r. There are three 
cases about the parent element /(t) of r: /(r) is a sliver; /(r) has a large radius- 
edge ratio; /(t) is an encroached fc-dimensional boundary simplex. Recall that 
the parent vertex p = p(v) is the most recently inserted vertex of r. 

Subcase 1.3.1: /(r) is a sliver. Recall that the insertion of p from the picking 
region of sliver /(t) will always avoid creating small slivers. Thus, Ry > bRji^^p 
where i?/(r) is the circumradius of /(r). Notice that Cp < {1 + d)i?/(r)- Thus, 
e„ > (1 - S)bRft^r) > ]=^b ■ Cp. 

Subcase 1.3.2: /(r) has p(/(r)) > go- Let pq be an edge of simplex r, where 
p is inserted from the picking region of /(t). Notice that Cp < \\p — q\\. We also 
have Rt > \\p — <z||/2. Thus, e„ > (1 — 6)Rp > (1 — 6)Rt > ■ Cp. 

Final subcase 1.3.3: /(r) is an encroached boundary ^-simplex. We first con- 
sider the scenario that /(r) is encroached by a sliver-simplex directly or indi- 
rectly. We then know that the insertion of p from /(r) will always avoid creating 
small slivers because the mesh is almost-good. Thus Ry > bRf(^y). Notice that 
Cp < (1 + S)Rf(^yy Thus, e„ > (1 — S)bRf(^y-^ > ^^b ■ Cp. We then consider the 
scenario that /(t) is encroached by a d-simplex with a large radius-edge ratio 
directly or indirectly. Here parent p is selected from the picking region of /(r). 
Let pq be an edge of r. Notice that Cp < \\p — gjj. We also have Ry > \\p — q\\/2. 
Thus, e„ > (1 - 5)Rp > (1 - 5)Ry >^-ep. 

Then we consider that v is selected from the picking region of a boundary 
/c-simplex y, which is encroached by a m-simplex p. 

Case 2.1: p is encroached by a sliver-simplex directly or indirectly. Here 
parent p is always the circumcenter of p. Notice that Rp < -\/2Rx — 

(1 + 5)Rp. Thus, e„ > (1 - 5)R^ > • Cp. 
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Case 2.2: ^ is encroached directly or indirectly by a d-simplex with a large 
radius-edge ratio. Parent p is the circumcenter of p. We always have < V^Rx 
and 6p = Rp. Thus, Cy = Rx> RpV^ > ^ ■ Cp. ED 



For a vertex v, as [10], we define Dy = We call Dy the density ratio 

at point V. Clearly, initially Dy is at most one for an input vertex v, and after 
inserting new vertices, Dy tends to become larger. 

Lemma 12. Let v he a vertex with a parent p = p{v) if there is any. Assume that 
&v > C-Bp. Ifv is inserted due to eliminating sliver-simplex, then Dy < + 

If V is inserted beause of eliminating a d-simplex with a large radius-edge ratio, 
then Dy <l-\- 

The proof is omitted, which is almost the same as the 3-dimensional counterpart. 



Theorem 3. There are fixed constants Dk > I, I < k < d such at for any 
vertex v inserted or rejected at the picking region of a bad k-simplex, Dy < Dk- 
Specifically, the values of Di should satisfy the following conditions: 



Dd > max{ 



QqA -\- \/2 Bd-2 
Po(l — d) — 2^0^“^ ’ 



a6 -I- -I- aBd-2 

b - 



Dd-k = Bk-i -\- V2 a^Dd, 



a -\- 



2 

1-d 






where l<k< d—1, A = o;(l — (5)-|- and Bi = Hence, 

there is a constant D = max^^j^jllfc} such that Dy < D for all mesh vertex v. 

The proof, which is based on induction, of the theorem is omitted due to space 
limit. The following theorem concludes that the generated mesh has good grad- 
ing, i.e., for any mesh vertex v, N{v) is at least some constant factor of lfs{v). 



Theorem 4. For any mesh vertex v generated by refinement algorithm, the dis- 
tance connected to its nearest neighbor vertex u is at least 

The proof is omitted here, which is the same as the three-dimensional counter- 
part. Thus, if Qo > 2‘^a'^ and b > \/2 a'^, our algorithm generates well-shaped 

Delaunay meshes with good grading. Ruppert showed that the nearest neighbor 
value N{v) of a mesh vertex v of any almost-good mesh is at most a constant 
factor of Ifs (v ) , where the constant depends on the radius-edge ratio of the mesh. 
The above Lemma 4 shows that the nearest neighbor N{v) for the well-shaped 
Delaunay mesh is at least some constant factor of lfs{v). Notice that the num- 
ber of vertices of an almost-good mesh is j^jj^dx). Then we have the 

following theorem. 



Theorem 5. Given a d-dimensional almost-good mesh with n vertices, the gen- 
erated well-shaped mesh has 0{n) vertices, where the constant depends on d and 
the radius-edge ratios of the meshes. 
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7 Discussions 



In this paper, we present a refinement-based method that guarantees to remove 
all slivers in a d-dimensional almost-good simplicial mesh. Notice that the (Tq 
derived from all the proofs may be too small for any practical use. We would 
like to conduct some experiments to see what ctq can guarantee that there is no 
small slivers created. In addition, we could have different definitions about slivers 
depending upon the location of the simplex: inside or near the domain boundary. 
This could also improve the bound on cto- Let pd be the minimum radius-edge 
ratio of a d-simplex. It is easy to show that P2 = ^ and pd = — 1^2 ' which 



is much less than the radius-edge ratio bound qq achieved by our algorithm 
(and also the Delaunay refinement). We would like to know if we can get better 
radius-edge ratio bound on the d-simplices of the generated mesh. 
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Abstract. We state the following conjecture: any two planar n-point 
sets (that agree on the number of convex hull points) can be triangulated 
in a compatible manner, i.e., such that the resulting two planar graphs 
are isomorphic. The conjecture is proved true for point sets with at most 
three interior points. We further exhibit a class of point sets which can 
be triangulated compatibly with any other set (that satisfies the obvious 
size and hull restrictions). Finally, we prove that adding a small number 
of Steiner points (the number of interior points minus two) always allows 
for compatible triangulations. 



1 Introduction 

Can any two planar point sets (that agree on the number of points and ex- 
treme points) be triangulated in the same manner? This intuitive question is 
the topic of the present paper. Apart from the theoretical interest in this basic 
problem, questions of this kind arise in various areas of application, including 
image analysis, morphing, and cartography. 

To define the problem more carefully, let Ai and S2 be two finite sets of points 
in the Euclidean plane. Two triangulations T\ of S\ and of S2 are called com- 
patible if the face lattices formed by their triangles, edges, and vertices (points) 
are isomorphic. Compatible triangulations obviously exhibit isomorphic graphs 
of edges. The converse is not true, in general, as an edge isomorphism between 
T\ and T2 need not preserve the external face. This led us to prefer the notion 
of compatible triangulations to isomorphic triangulations which sometimes has 
been used in the literature. 



* Research supported by APART [Austrian Programme for Advanced Research and 
Technology] of the Austrian Acedemy of Sciences, and the EU Improving Human 
Potential Programme HPRI-1999-CT-00071 
** Research supported by the FWF [Austrian Fonds zur Forderung der Wis- 
senschaftlichen Forschung] 

* * * Research supported by Proyectos DGES-MEC PB98-0933 and Gen. Gat. 
1999SGR00356 



J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 101-110, 2001. 
© Springer- Verlag Berlin Heidelberg 2001 




102 



Oswin Aichholzer et al. 



The problem of triangulating two given point sets compatibly comes in two 
flavors, namely where the bijection between the points of Si and S 2 is either 
fixed or variable. The case of fixed correspondence is a known problem and has 
been studied in Saalfeld [7]. He pointed out that compatible triangulations do 
not always exist and proposed several heuristic approaches for their construction. 
The challenging problem left open is to determine the complexity of the related 
decision problem, which so far is neither known to be NP-complete nor to be 
polynomially solvable. 

The problem becomes easier if and S 2 represent the cyclic order of the 
vertices of two simple n-gons. Given a bijection between the polygon vertices 
(there are only n cyclic shifts to choose from) the existence can be decided in time 
0{vS), and compatible triangulability of the polygons can always be achieved 
by adding O(n^) extra points (so-called Steiner points) in either polygon; see 
Aronov et al. [2]. Improvements which take into account the number of reflex 
vertices [5] or the quality of the angles of the produced triangles [3] exist. 

The present work is concerned with the problem of finding compatible tri- 
angulations without fixed point correspondence. To our knowledge this problem 
has not been studied before. The main question is, of course, under which propo- 
sitions compatible triangulations do exist. It is clear that in both sets S'! and 
S 2 the number n of points must be the same, as has to be the number h of 
points on their convex hulls CH{Si) and CH{S 2 ), by the well-known formula 
e = 3n — ft. — 3 on the number of edges of a triangulation. Moreover, convex hull 
points necessarily map to convex hull points, and their cyclic order is preserved. 
This follows from the easily proven fact below: if two triangulations are com- 
patible then any two corresponding triangles ijk and i' j'k' must have the same 
orientation, either clockwise or counter-clockwise. (We ignore the symmetric case 
where the orientation is reversed for each matching pair of triangles.) Intuitively 
speaking, this means that the underlying point sets S\ and S 2 ’locally’ exhibit 
the same order type, as defined in Goodman and Pollack [4]. It is interesting to 
note that the last two properties need not hold if only graph isomorphism rather 
than compatibility of the triangulations is required. 

We will assume throughout that all point sets considered are in general po- 
sition, meaning that no three points within a set lie on a common straight line. 
There are simple examples of point sets not in general position that do not admit 
compatible triangulations even if the propositions mentioned above are all met; 
see e.g. Krasser [6]. 

Throughout, let |C'i7(5')| denote the number of extreme points in a point set 
S. Our main conjecture can be formulated in the following way. 

Conjecture 1 (Main Conjecture on Compatible Triangulations) 

Let Si and S 2 he two sets of points without fixed correspondence. Then compatible 
triangulations for Si and S 2 exist if these sets fulfill the following properties: 

- I^iI = |52|, 

- \CH{Si)\ = \CH{S 2 )\, and 

- Si and S 2 are in general position. 
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We also consider an extended version of the compatible triangulation prob- 
lem, where the correspondence between the points on the convex hulls (but not 
between the interior points) is prescribed. The status of the extended problem 
is still unresolved, too. 

Conjecture 2 (Extended Conjecture on Compatible Triangulations) 

Conjecture 1 still holds if the correspondence between the extreme points of Si 
and S 2 is given by a cyclically shifted labeling. 

The motivation for stating the extended conjecture stems from the attempt to 
prove Conjecture 1 by induction, which naturally leads to assuming the stronger 
version as a hypothesis. 

In this paper, first steps towards proving the conjectures above are under- 
taken. In fact, obtaining an affirmative answer would be a deep theoretical result, 
showing that all planar point sets of the same size and hull size are ’topologically 
equivalent’ in this sense. The result, if true, gains in importance in view of the 
huge number of inequivalent order types for n points; see Aichholzer et al. [1] 
for the exact numbers of order types for n < 10. 

Section 2 exhibits a family of point sets each of whose members allows for 
a compatible triangulation with any other point set (within the evident size 
and hull restrictions). In Section 3 we prove Conjecture 2 provided the number 
of interior points in either set is three or less. These restricted cases already 
reflect part of the intrinsic complexity of the problem: the orientation of the 
three interior points need to be reversed to allow for a solution in some cases. 
Section 4 shows that compatible triangulability can always be achieved by adding 
at most k — 2 Steiner points, where k denotes the number of interior points. 
This contrasts the afore-mentioned situation for simple polygons [2], where a 
quadratic number of Steiner points is necessary in the worst case, and is a result 
of practical relevance. Finally, Section 5 concludes the paper with some related 
remarks. 

2 A Family of Universal Sets 

A point set U is called weak universal if for every point set S with [S'! = \U\ and 
\CH (S) I = \CH{U) I there is a triangulation of S compatible with a triangulation 
of U. The set U is called universal if, in addition, the correspondence between 
the extreme points of S and U may be prescribed by a cyclic shift. Conjecture 2, 
if true, states that all point sets are universal. Here we prove that certain point 
sets are indeed universal. 

Let [/ be a point set with extreme points Mi, . . . ,u/j (as they appear counter- 
clockwise on the convex hull) and internal points Uh+i, ■ ■ ■ ,Un such that, for 
some 1 < m < h — 2, 

— the subset U \ {ui, . . . , Um} is in convex position, and 

— the internal set {uh+i, • ■ • , Un\ lies to the left of both UhUm and UiUm+T see 

Figure 1. 
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Fig. 1. The point set U. 



We are going to prove that C/ is a universal set. The proof of the result 
requires a preliminary lemma. 

Lemma 1 Let P be a convex polygon with vertices vi,...,Vh in counter- 
clockwise order, and let I be an arbitrary point set interior to P. For any choice 
of indices 1 < i, j < h with i < j — 2 there exists a simple path tt of edges such 
that 

(a) path 7T connects Vi, I, and Vj (in this order) and these are the only vertices 
of it; 

(b) the simple polygon bounded by tt and ViVi+i . . .Vj can be triangulated 
without using an edge that connects two vertices ofir. 

Proof. By induction on d = j — i > 2, the distance between Vi and Vj on P's 
boundary. For d = 2 the result is trivial: sort / U {vi,Vj} radially around Vi+i, 
to produce a path star-shaped as seen from Vi+i, and triangulate by connecting 
all points in I to this vertex. 

Let now d > 2. If the set I is totally to the right of Vi+ 2 i’j then connect Vi 
and Ui _|_2 and slice off P the triangle ViVi+iVi +2 which cannot contain any point 
in I. Apply induction to the remaining convex polygon; the boundary distance 
between Vi and Vj is now d — 1. 

If, on the other hand, there are points of / to the left of Vi-^. 2 Vj then let p be 
the one maximizing the angle ViVi+ip; see Figure 2. Connect p to vt+i and Ui +2 
and observe that the resulting triangle encloses no points of I. Now consider 
the convex polygon Q = pVi +2 ■ ■ - Vj (shaded in Figure 2). Let I' and I" be the 
subsets of / exterior and interior to Q, respectively. By construction, no point 
of /' lies to the right of So, as done for the case d = 2 above, construct 

the path tt' for I' U {vi,p} that is star-shaped as seen from Vi+i, and build the 
corresponding triangulation. Finally, apply induction to the polygon Q whose 
boundary distance between p and Uj is d — 1. This yields a simple path tt" for 
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I” U {p,Vj} whose concatenation with tt' obviously is still simple and satisfies 
the statement of the lemma. EH 




Fig. 2. Constructing the triangulation T{tt). 



Theorem 1 Let U be a set as described above. Then U is a universal set. 

Proof. Let S be any point set with [S'! = |C7| = n and with extreme points 
Vi^ . . . ,Vfi in counter-clockwise order. We construct for CH{S) the path tt from 
Vh to Vm+ii and the associated triangulation T{tt) on tt’s right-hand side, whose 
existence is given by Lemma 1. (Note that the index m is given from U and 
that the counter-clockwise boundary distance between Vh and Vm+i is at least 
two.) The crucial property is that T(tt) can be reproduced on the simple polygon 
Uh . . . UnUm+iUm . . . U\ \ Only edges between non-consecutive vertices of the path 
Uh . ■ . UnUm+i lie outside this polygon, but those correspond to edges outruled by 
the definition of T(7 t). It remains to be observed that any triangulation of CH{S) 
on tt’s left-hand side can be put on top of the convex polygon Um+iUm +2 ■ ■ • Un, 
making compatible the global triangulations. EH 



3 Few Interior Points 

Below we prove Conjecture 2 for point sets with at most three interior points. 
Even in these seemingly simple situations, the illusion of a quick proof is de- 
stroyed by the fact that the orientation of the three interior points might have 
to be reversed to achieve compatibility. We first focus on the following class of 
point sets. 
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Lemma 2 Let S be a point set eontaining an extreme point p such that S \ {p} 
is in convex position. Then Conjecture 2 is true. 

Proof. The set S fulfills the requirements for the set U in Section 2 for m = 1 
and thus is a universal set by Theorem 1. EH 

Theorem 2 Let S\ and S 2 be two n-point sets each of which contains fc < 3 
interior points. Then Conjecture 2 is true. 

Proof. The cases fc = 0, where S'! and S 2 are in convex position, and fc = 1, 
where star-like compatible triangulations always exist, are trivial. 

The case fc = 2 is still easy. It is always possible to find two extreme points of 
Si labeled a and b such that (1) a and b lie on different sides of the line through 
the two interior points of Si, and (2) the (extreme) points of S 2 labeled a and 
b lie on different sides of the line through the two interior points of S 2 . (The 
non-existence of such a pair a, b would imply that one of and S 2 totally lies 
on one side of the respective line.) We build two triangles for Si by joining its 
interior points x and y and connecting a and b to both of them. The remaining 
extreme points can be connected to either x or y accordingly. Doing the same 
for S 2 results in compatible triangulations. 

The case fc = 3 is more intriguing. Let us refer to the triangle formed by 
the three interior points of (or S' 2 ) as the interior triangle of Si (or S 2 ). We 
distinguish between two situations, the former being similar to the case fc = 2. 

(a) Si as well as S '2 contains extreme points with labels a, b, c each of which 
allows for a triangle adjacent to the respective interior triangle such that no 
overlap of these four triangles occurs. In this case, we integrate these triangles 
for both sets. For extreme points with labels intermediate to a, b, and c, edges 
can be drawn now to a unique interior point, in a compatible manner and much 
like the case fc = 2. 

(b) Otherwise. The solution we are going to construct will not contain the 
interior triangle. (In fact, no such solution exists.) Let xyz be this triangle for 
Si. Let Dx be the subset of S'! not on a;’s side of the line through y and z. Define 
subsets Dy and analogously; see Figure 3(a). Since labels as above cannot 
be found, the subsets of S '2 that correspond to D^, Dy, and have to look as 
in Figure 3(b), where two of them are totally contained in the same trilateral 
cell. Without loss of generality, let the third subset correspond to D^. We wish 
to redefine D^, Dy, and D^ so as to be disjoint sets, by introducing ’breakpoint’ 
labels a, b, and c such that D^ ranges from o -I- 1 to &, Dy ranges from 6 -|- 1 to 
c, and Dz ranges from c -I- 1 to a. This is always possible unless two of the sets 
are singletons consisting of the same point p. In this case, however, removal of 
p leaves the set \ {p} in convex position such that compatible triangulations 
exist by Lemma 2. Otherwise, we assign labels x, y, and z to the interior triangle 
of S '2 and draw compatible edges as shown in Figure 4. Note that the orientation 
of xyz has been reversed for S 2 . The two shaded polygons for Si are convex and 
thus can be triangulated compatibly with their counterparts for S 2 . It remains 
to partition the still untriangulated area for Si into convex parts. Possible reflex 
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Fig. 3. Two subsets move into the same cell. 




Fig. 4. Compatible edge skeletons that yield convex parts. 



angles may occur at x or y. If, as in Figure 4, this happens for x then there must 
exist a point labeled d in \ Dy which makes this angle convex. (If d would 
not exist then all extreme points of S 2 would lie on the same side of the line 
through X and z.) The situation for y is similar. This completes the construction 
of compatible triangulations. E3 



4 When Steiner Points Are Allowed 

Since it is still left open whether Conjectures 1 and 2 are true, the question 
of whether compatibility can be always forced by adding a reasonably small 
number of Steiner points suggests itself. We demonstrate below that a linear 
number suffices. An efficient algorithm for finding compatible triangulations is 
implicit in the proof. 
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Let V be an extreme point of the set S. The ear E(y) of v is the triangle 
spanned by the two edges of CH{S) incident to v. An ear is called empty if it 
does not enclose any (interior) point of S. 

Theorem 3 Let Si and S 2 be two n-point sets whose n — k extreme points are 
in fixed correspondence. If k >2 then Si and S 2 can he triangulated compatibly 
by introducing at most k — 2 Steiner points. 

Proof. By induction on k, the number of interior points. Let cr{k) be the number 
of Steiner points needed. We trivially have cr(2) = (t( 3) = 0 by Theorem 2, so 
let us assume fc > 4. 

Let vi, . . . ,Vhhe the extreme points of S'! , and wi, . . . ,Wh the corresponding 
extreme points of S 2 , in counter-clockwise order. We may assume that for no 
index i the ears E{vi) and E(wi) are simultaneously empty. Otherwise, com- 
patible edges Vi-iVi+i and Wi-iWi+i are introduced, and the sets Si \ {uj} and 
S 2 \ {wi} are considered. It remains to discuss the cases below. 

Case (a) There is an index i such that both ears E{vi) and E{wi) are non- 
empty. Let p G S'! be the first point hit when rotating the edge Vt-iVi counter- 
clockwise around Vi-i. As E(vi) contains points, p must be an interior point. 
So by introducing one Steiner point s slightly to the left of Vi-ip and close to 
edge ViVi+i, it is possible to create an empty triangle Vips and three adjacent 
empty triangles as shown in Figure 5(a). Obviously, the same construction can be 
repeated for the ear E{wi) using a suitable Steiner point t, giving four compatible 
triangles. We now apply induction to the sets Si U {s} \ {uj} and S 2 U {t} \ {wj} 
whose number of interior points is /c — 1 > 3. This implies ^{k) < a{k — 1) -I- 1 < 
{k-l)-2+l = k-2. 

Case (b) For each index i, exactly one of the ears E{vi) and E{wi) is empty. 
Clearly Si (and S 2 ) has ft, > 4 extreme points in this case. Let us assume ft > 5 
first. Then one of the sets. Si say, has at least three empty ears. Choose two of 
them which do not overlap, E(vi) and E{vj), for j >i + 2. Consider the convex 
polygon CH{Si) \ (E{vi) U E{vj)), and some triangulation Ti thereof which 
uses only edges which separate E{vi) and E{vj). Let T 2 denote the compatible 
triangulation of the corresponding polygon for S 2 . Now sweep Ti, triangle by 
triangle, with a line starting at position Vi+iVi-i and stopping at position 
Vj-iVj+i. When sweeping a triangle we require ii to pass through 

point Va and to cross the edge VbVb+i. Sweep T 2 with a line £2 in a similar 
manner. Let ai be the current number of interior points of to the left of £i 
(same for 02 )- Since E{vi) and E{vj) are empty ears we have = 0 in the 
beginning and = ft at the end. On the other hand, 02 > 1 in the beginning 
and 02 < ft ~ 1 at the end, because E{wi) and E{wj) are both non-empty. As 
a consequence, there exist indices a and b that yield ai = 02 while £i sweeps 
the triangle VaVbVb+i and £2 sweeps the corresponding triangle WaWbWb+i. Now 
pick a Steiner point s on (this very position of) £i and close to VbVb+i^ and put 
in edges from s to Vb^ Vb+i, and Va] see Figure 5(b). Repeat the construction for 
S 2 which gives a compatible triangle. Finally, apply induction to the respective 
subsets of Si and S 2 on either side of the lines. They have ft' and ft" interior 




Towards Compatible Triangulations 109 



points, respectively, for 1 < fc', k" <k—l. We may assume k' >2 because fc > 4. 
We get cr(fc) < a{k') + cr(fc") + 1, with (r{k') < k' — 2 and u{k") < k" — 1 and 
equality in the last term if k” = 1. In summary, cr(fc) < fc — 2 again. 

The case fc = 4 which has not been covered yet is easier. There is a line ii 
crossing the edges ViV 2 and U3U4 which has exactly 2 interior points of Si to its 
left and fc — 2 > 2 such points to its right. There is also a similar line £2 that 
crosses the corresponding edges for S'2. Pick two Steiner points on £ 1 , one close 
to V 1 V 2 and the other close to V 3 V 4 , and create the two respective triangles, as 
shown in Figure 5(c). Repeat the construction for £2 and S 2 which results in two 
compatible triangles. Now apply induction to the respective subsets of Si and S 2 
on either side of the lines. This gives <j{k) < a{2)+cr{k—2)+2 < (fc— 4)+2 = fc— 2 
which completes the argumentation. EH 





(c) 




Fig. 5. Placing Steiner points. 



5 Remarks 

It is a challenging open problem to prove our conjectures for general point sets. 
Unfortunately, the constructions in the proof of Theorem 2 do not seem to gener- 
alize easily to the case of more interior points. Approaches to prove Conjecture 2 
by induction suffer from the lack of appropriate methods for splitting the prob- 
lem. Here is a discouraging related fact: given two quadrilaterals, one convex and 
the other star-shaped, plus one interior point for each, it may not be possible 
to triangulate them compatibly if the correspondence between their boundary 
vertices is fixed by a cyclic shift. 

We tested Conjecture 2 for small point set sizes, utilizing a database of com- 
binatorially inequivalent point sets (i.e., order types); see [1]. If two point sets 
exhibit the same order type then every triangulation of one set also leads to a 
compatible triangulation for the other. By checking all different pairs of order 
types we could verify the conjecture for up to 8 points. Although our database 
provides all order types of size up to 10, exhaustive tests turned out to be time 
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consuming and are still incomplete for size 9. In fact, determining the time 
complexity of computing compatible triangulations, if they exist, for two given 
n-point sets is an open problem. 

Finally, we pose the problem of triangulating compatibly two point sets in 
fixed correspondence when Steiner points are allowed. In view of the applications 
mentioned in [7] , a fast algorithm using a small number of Steiner points would 
be of practical relevance. 
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Abstract. Let C be the convex-hull of a set of points S with integral 
coordinates in the plane. It is well-known that \C\ < for some 

constant c where D is the diameter of S: i.e. the maximum distance 
between any pair of points in S. It has been shown that c = 7.559.. 
for an arbitrary S, and c = 3.496.. in the special case when S' is a 
ball centered at the origin in the plane. In this paper we show that 
c = 12/-^ 47 t 2 = 3.524.. is sufficient for an arbitrary set of lattice points 
S of diameter D in the plane, and \C\ ~ 12 ^2/(9-7r2) = 3.388..D^^® 

is achieved asymptotically. Our proof is based on the construction of a 
special set in first quadrant, and the analysis of the result involves the 
calculation of the average order of certain number-theoretical functions 
associated with the Euler totient function 4>{n). 



1 Introduction 

A lattice point is a point with integral coordinates. Given a set S of lattice points 
in the plane, let C be the convex-hull of S, and denote the number of extreme 
points in C by |C|. The behavior of |C| as a function of parameters associated 
with S has been studied in various contexts in computational geometry, computer 
graphics [10,6,9], and integer programming [1,3,5]. 

Andrews’ general theorem [1] on convex bodies in d-dimensional space implies 
that in the plane |C| is bounded by 

|C| < c£|2/3 (1) 

for some constant c, where D is the diameter of S\ i.e. the maximum distance 
between any two points in S. Another proof of this bound for the plane was 
given by Katz and Volper [9]. The constant c has also been studied. Balog and 
Barany [2] showed that when S is the ball of radius r = D/2 centered at the 
origin, i.e. S = rB^, then 

0.207.. < |C| < 3.496. 

* Supported in part by NSF Grant No. CCR-9821038, and a UCSB-COR faculty 
research grant. 
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Therefore one can take c = 3.496.. in the special case when S' is a ball of diameter 
D. A d-dimensional analysis for the ball appears in [4]. Har-Peled [7] showed 
that the value c = = 7.559.. is sufficient for the bound (1) in the plane for 

arbitrary S. 

In this paper we investigate further the nature of the constant c in (1) and 
show that 

c = 12/v^ 47 t2 = 3.524.. 

suffices for an arbitrary S of diameter D in the plane. Our proof is based on the 
construction of a special set in first quadrant of the plane which satisfies certain 
constraints. The construction involves selecting a set of fractions in a particular 
order as slopes of the line segments of the convex-hull. Based on the properties 
of the average order of certain number-theoretical functions associated with the 
Euler totient function 4>{n), we derive an upper bound on the size of the set 
constructed. This leads to an improved value for c. We also show that using the 
construction idea of the proof, we can always create a convex-hull C with a given 
diameter D such that 

\C\ ~ 12^2/(97t2) = 3.388. (2) 

is achieved asymptotically. 

The organization of this paper is as follows. Section 2 gives the number- 
theoretical background we require, and includes the proof of the main theorem, 
which we then use in Section 3 for proving our result on the improved value of c. 
In Section 4, we construct large convex-hulls with a given diameter proving (2). 



2 Number-Theoretical Definitions and Results 



We use the classical book by Hardy and Wright [8] as our main reference for the 
definitions and basic results used in this section. We denote by (r, s) the greatest 
common divisor of r and s. 



• The Euler totient function 4>{n) ([8], p. 52) is defined as follows: 

1 . 0 ( 1 ) = 1 

2. for n > 1, 0(n) is the number of positive integers less than n and rela- 
tively prime to n. 

• The Mobius function ^{n) ([8], p. 234) is defined by 

1. /x(l) = l, 

2. = 0 if n has a square factor, 

3. ^{pip 2 ■ ■ -Pk) = (~1)^ if all the primes pi,P 2 , ■ ■ ■ ,Pk are different. 

• The Riemann zeta function <C(s) ([8], p. 245) is defined for s > 1 by 



c(») = E 

n=l 



1 



We use the following well-known results: 




An Improved Upper Bound on the Size of Planar Convex-Hulls 



113 



Lemma 1. 



d\n 

li{d) 6 



c(2)-'=i: 



d=l 



(P TT^ 



(3) 

(4) 



Proof. The proof of (3) is in ([81, p. 235). The proof of (4) can be found in ([81, 
Thm. 287, p. 250; Thm. 293, p. 251). 

Let t^o(ti) = <('(1) + ^(2) + • • • + 4i{n). It is known in relation to Farey fractions 
([8[, Thm. 330, p. 268) that 



3rP 

<Po{n) = + O(nlogn), 

7 T^ 



(5) 



so that the average order (([8[, p. 263) of the function 4>{n) is given by (5). 
Theorem 1 below gives an expression for the average order of the function rf <j)(ri) 
for r > 0, generalizing (5). 

Theorem 1. For any integer r > 0 



<p,(n) = r-f>(l) + 2^-f>(2) + - ■ •+n’'-^(n) = 
Proof. By (3) of lemma 1 we have 



6 



(r -I- 2)7 



j^r-H^_l_Q(^r-|-r log j^) _ ^0^ 



<p^(n) = nP (j){m) = Y = Y ^ nf d' g.{d) 



m—1 



ra—1 d\r 



m—1 dd'—m 



[n/dj 



= E E rf’'W’'+V(rf)= E d^dr+^,a{d) = Yd^F{d) E 

m—1 dd'—m dd'<n d—1 d' — l 

Using the fact that 



E '=’■” = 



n 



r+2 



fc=l 



r -I- 2 



+ 0(n"+^). 



we get 



Therefore 



[n/dj 



2 : (“r*' = 7^ ( 



1 / n \’’+2 



d' = l 



d) 



O 



/ \ 

■ 



<Pr{n) = 



^r+2 






r + 2 ^ d? 
d=l 



^r-l-2 



E#+ 0 (n-^ E ;^l+ 0 (n-^logn) 



r -I- 2 ^ d? 

d=l 



d—n-\-l 



.r+2 



r -I- 2 



C(2)-i -h 0(n’'+i) -h 0(n’'+i log n) , 
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and the theorem follows from the last equality and (4) of lemma 1. 

We need the following result relating (l>o{n) and as a step in our study 

of |C| . 

Theorem 2. Let <Po{n) = ^(1) + 4>(2) + • • • + 4>(n), 

<Pi{n) = 1 • </>(!) + 2 • <j>{2) H h n • 

Then 

<Po{n) ~ ^ = 0.8810516..^>i(n)2/3. (7) 

V dTT^ 

Proof. The theorem follows by combining the expressions for <Po(n) and <Ti(n) 
obtained as the cases r = 0, and r = 1 of theorem 1. 

We note that also the magnitude of the error term in (7) can be calculated 
by using the full expressions for and <Ti{n) obtained. This gives 

Ti{nf/^ + oUi{nf/Hog<Pi{n)) . 

We omit the details of this calculation. 

3 An Improved Upper Bound 

We first establish an upper bound on the size of the convex-hull of lattice points 
in first quadrant of the plane in the following lemma. 

Lemma 2. Let Sa,b be a set of lattice points in first quadrant of the xy-plane 
enclosed by y = a and x = b. For the convex-hull Ca^b of Sa^b> there holds 

\Ca,b\ < 3 T^-^ 6)^/^ log (a -b b)j 

Proof. Without loss of generality, Sa,b includes the points (0,a) and (6,0) and 
both a and 6 are positive integers (Figure 1). Instead of the point-set Ca,b we con- 
sider the corresponding set of slopes Ra,b of the line segments (edges) connecting 
the consecutive extreme points of the convex-hull. Clearly, \Ca,b\ = \T^a,b \ + 1, so 
we can alternately study the properties of the set TZa,b- By convexity, the slopes 
of the edges of the convex-hull are all different. Furthermore Sa,b is a set of 
lattice points, and therefore the slopes of the non-vertical edges are all rational 
numbers. 

Let be an optimal set for the following problem: maximize |77.| subject 

to 

'^y + '^x = '^y + x < a + b (8) 

|g 7?, fGT?, 

Clearly, for any a and 6, \R-a,b\ < \T^*a+b\- We will find a bound for . 

Let Qi for z > 0 be the set of slopes defined as follows: 
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a 




A 



Fig. 1. A set of lattice points Sa,b in first quadrant. 



1. Qo = 0, 

2. Qi = {f, g}. We assume that the fraction ^ is defined as a slope and it 
represents the vertical edge whose length is one unit. 

3. For t > 1, Qi = {| \ y + x = i and (y,x) = l}. 

Table 1 illustrates the first few values Qi, ^o(*)> and The following 
properties can easily be seen for t > 1: 

\Qt\ = 

^ y + x = i \Qi\ 

|6Ci 

Let n -I- 1 be the smallest number such that Yl7=i *I2*I > a + b, and Q^+i be 
an arbitrary subset of Q„+i such that 

|2n-|-ll = 

Then consider the following set 

Ti-a+b ~ Qn+i- (11) 

We claim that TZa+b is a maximal set which satisfies the constraint (8). The 
expression (11) describes a greedy construction : To include in set TZa+b, select 
a fraction whose numerator-denominator sum is the smallest. Continue includ- 
ing fractions until the sum of all the numerators and the denominators of the 
fractions currently in the set exceeds a + b . Since TZa+b is a set, no fraction 
can be included in the set more than once, and since the slopes are all different, 
among the equivalent fractions, the irreducible one has the smallest numerator- 
denominator sum. This explains why only the relatively prime numbers are to 
be considered in the construction. 



ja + b) 

n + 1 



(9) 

(10) 
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Table 1. Table of Qi, (j){i), ^o(*)> and ^i{i) for i = 1, 2, . . . , 7. 



i 


Qi 




$o{i) 


<^i(j) 


1 


{t4} 


1 


1 


1 


2 


{ii 


1 


2 


3 


3 


|i 2 | 

1 2 ’ 1 J 


2 


4 


9 


4 


|i 3| 
l3’ 1 J 


2 


6 


17 


5 


r 1 2 3 44 
1 4 ’ 3 ’ 2 ’ 1 J 


4 


10 


37 


6 


|i 54 
1 5 ’ 1 J 


2 


12 


49 


7 


r 1 2 3 4 5 61 
l6’5’4’3’2’lJ 


6 


18 


91 



From (11) we have 

n n+1 

U 2* ^ ^a+6 C y Q, (12) 

Using (12) and (9) we find that 

n n+1 

^+i) + 1 < |7^a+^.| < ^+^) + 1 (13) 

i=l i=l 

Again using (12) and (9) together with the expression (10), and using the fact 
that by construction n + 1 is the smallest number such that Yl7=i *I2*I > a + b 
we get 



n n+1 

+ 1 < a + 6 < ^ +(i) + 1 (14) 

i=l i=l 

We note that 4>{n+ 1) = 0(<?i(n)^/^) using the expression <Pi{n) obtained 
from (6) with r = 1. Therefore we can write the following upper bound for |77.o+b| 
using (13): 

n 

|7^„+b| = ^ +t) + O (+(n)i/3) = <Po{n) + O (+(+i/3) 

Furthermore from Theorem 2 

\TZa+b\ = -3^|=<?i(n)2/3 + o(<?i(n)i/+og<l>i(n)) +0(<Z>i(n)i/3) , (15) 

and from (14) we obtain that <?i(n) ~ a + 6. Therefore 

\T^a+b\ = (i7=r + O ((a + 6)^/3 log (a + 6)) 

which proves the bound for \Ca,b\ of the lemma. 
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Theorem 3. For the convex-hull C of a set S of lattice points of diameter D in 
the plane, there holds 

Proof. S can be enclosed by an axis-parallel rectangle whose sides are at most 
D in length. Using this rectangle, we can partition S into four parts such that 
each part is confined to a single quadrant as shown in Figure 2. For the size of 
C we have 

\C\ < |Cai, 6 i| + 1^02,62! + 1^03.63! + \Cai,bi\ 

By Lemma 2 we can rewrite the inequality as 

\C\ < ^^J=((ai-h6i)^/^+(a2+62)^^^+(a3+63)^/^+(a4+64)^^^)+0(L>^/^logL>) 
V 47 t^ 

(16) 







Fig. 2. Partitions of S in quadrants. 

The function f{x) = x^^^ is concave on [0, 00 ). In particular for nonnegative 
xi and X 2 , 

^^2/3 ^ ^^2/3 ^ 2 

Therefore, the equality in (16) becomes 

O log 

Since ai -I- 62, 03 -I- 64, 02 + 63, and 04 -I- bi are all smaller than or equal to D, 
the inequality for \C\ of the theorem holds. 



|C|< 



^ 47T^ 



(ll + 61 + Ct2 + ^2 + 03 + ^3 + 04 + ^4 



2/3 
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Fig. 3. n = 7, a = 5 = 46 = 5^1 (7) + The dotted line is used for the convex- 
hull of the set of lattice points in the ball of radius 46 in first quadrant. The 
solid lines show the constructed hull C(7). 



4 Constructing a Large Convex-Hull 
with a Given Diameter 

To investigate how tight the new value of c in the upper bound for \C\ is, we 
revisit the proof of Lemma 2. We first describe how we can actually construct a 
convex-hull Ca+b from the set of slopes TZa+b created in the proof for particular 
a and b. More precisely, for every n, we construct a convex-hull C(n) in first 
quadrant with a = b = ^^i(n) -I- symmetric about the line y = x. As its set 
of slopes, we take 

n 

T^in) = U Q^. 

i=l 

C(n) connects the point (0, -I- on the y-axis to (|^i(n) -I- |,0) on 

the x-axis and consists of ^o(n) edges. We start with the point (0, -I- |) 

as the first vertex, and if the i-th smallest fraction in TZ{n) is yi/xi, then we 
place the next vertex on the hull Xi units to the right and yi units down from 
the current vertex. See Figure 4 for the n = 5 case. 

Now consider the C(n) constructed as we described. If it includes a vertex 
whose distance from the origin is larger than then the diameter of 

the set constructed by taking four copies of C{n) around the origin will have 
diameter larger than D. That this is indeed the case can be seen from Figure 
3, where the point farthest from the origin in our construction is not inside the 
ball with radius 46. Thus the constant c = 12 / ^ = 3.352.. in the upper 

bound is not tight. 

We next investigate how close we can come to this value. It follows by sym- 
metry and convexity that the farthest vertex from the origin on C(n) constructed 
is one of the endpoints of the edge with slope 1 on the hull as shown in Figure 
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Fig. 4 . n = 5, a{n) = b{n) = + ^ = 19. 



4. Denote by h{n) the a;-coordinate of this point, and by r(n) its distance from 
the origin. By construction 

n j 

Hn) = JxKbj) = l]x[* < j] = = ^X[i + j <n] (17) 

where y is the indicator function of its argument, i.e., it is 1 if its argument 
evaluates to true, and 0 otherwise. 

We next show that h{n) ~ |^i(n). Since intercepts of the hull are a(n) = 
6(n) = l<?i(n) -I- 1 this implies h{n) ~ |a(n), and we can calculate r{n) by the 
Pythagorean formula as 

r{n) ~ ^a(n). (18) 

Theorem 4. Suppose h{n) is as defined in (17), and <?i(n) as defined in The- 
orem 2. Then h{n) = |<?i(n) -I- O(n^logn). 

Proof. We will indicate the derivation of the asymptotic part and ignore the 
calculation of the error terms. First we write h(n) = hi(n) h^fn) where in 
h\{n), the index j runs from 1 to n/2, and in /i 2 > j runs from nj2-\- 1 to n. Then 

hi{n) = '^.j'^xiihj) = 1 ], 

j=i i=i 

since for the indices in question i < j and we automatically have i j < n. 
Therefore hi{n) = On the other hand, if j > n/2, then n — j < j and 

n j 

h 2 {n) = iX!x[(bi) = l]x[* <n-j] 

*=i 
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n 

~ n (<l>o(n) - <^o(|)) - (<?iN - ‘^i(|)) • 



Using the expressions ^l>o{n) ~ and 'Pi{n) ^ 2v? j-K^ from Theorem 2, 

and adding the terms for hi(n) and h 2 {n), we obtain 



h{n) 



(I ^ 3 „ 1\ 3n3 



3 

8 



^i(n) 



as claimed. 



Theorem 5. For D large, there exists a set of lattice points in the plane with 
diameter D and convex-hull C such that 

\C\ ~ 12 L)2/3 = 3.388. 

Proof. If we set a = b = D where D is of the form <Fi{n), and use the 
construction for C{n), then r ~ Z?/2 by (18). By Lemma 2 

Since identical convex-hulls are constructed in all four quadrants to obtain C 
from C(n), the diameter of C is D and \C\ ~ 4|C(n)|. This proves the claim of 
the theorem when combined with (19). 
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Abstract. In this paper we study the following planar two-watchtower 
problem: given a terrain (A-monotone chain) T with n vertices, locate 
two vertical segments (watchtowers) uv and u'v' which have to be erected 
on T such that every point on T is visible to the top of either watchtowers 
(u or u') and the maximum height of uv, u'v' is minimized. We present 
an 0{n^) time algorithm to solve the discrete version of this problem 
when V, v' must be on some vertices of T. Under a mild condition on 
solving a special cubic equation with three bounded variables in 0[fz) 
time we can also generalize the algorithm to solve the general problem 
in 0{n^ time. Using parametric search, the discrete problem can 

be solved in 0(n® log^ n) time and the general problem can be solved in 
log^ n) time. 



1 Introduction 

Given a polyhedral terrain (mountain landscape), the shortest watchtower prob- 
lem is to compute the shortest vertical segment uv erected on the terrain such 
that every point on the terrain can be seen from the top of uv, say u. The prob- 
lem arises in geographical information systems, military surveillance, etc. The 
problem was first studied by Sharir [Sh88], who presented an 0(nlog^ n) time al- 
gorithm (where n is the size of the terrain) . The algorithm was later improved by 
Zhu to 0(n log n) time [Zh97]. On the related problems of guarding polyhedral 
terrains, readers are referred to [BDEG94,BSTZ97,O’R97,WZ00,Zh94]. 

A very natural generalization of the problem is to build a set of watchtowers 
which can collectively guard the terrain. The objective is certainly to minimize 
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the maximum (or sum) of the heights of the watchtowers. This general problem 
(in 3D) is NP-hard as the problem of deciding whether a set of points (which 
can be thought of as watchtowers with zero height) can collectively guard a 
given terrain is NP-hard [CS89]. Eidenbenz et al. proved that positioning the 
minimum number of watchtowers with fixed height (to cover a 3D terrain) is 
not only NP-hard, but also NP-hard to approximate within a ratio of Clogn, 
where C is a constant [ESW98]. However, if the number of towers k{k > 1) is 
fixed then almost nothing is known, yet the problem seems to be difficult even 
for fc = 2 in 3D. In this paper we first study the corresponding problem in 2D, 
i.e., given a terrain in 2D, compute two watchtowers uiVi and U 2 V 2 (with heights 
|uiUi| and |m 2 ^^ 2 | respectively) such that the terrain can be collectively guarded 
by the two watchtowers and the maximum of {|uiui|, IM 2 U 2 I} is minimized. 

Regarding guarding a 2D terrain (monotone chain), Chen et al. proved that 
it is NP-hard to compute the minimum number of guards (which can only lie 
on the 2D terrain) to cover the whole terrain [CEU95]. If the guards are all at 
fixed height above the 2D terrain, then Nilsson’s result implies a polynomial 
time solution to find the minimum number of guards [Ni94]. Our problem (even 
in 2D) is new and never been studied before, to the best of our knowledge. 

We first present some basic definitions related to this paper. Given a point p 
in the plane, let x{p),y{p) be the a;, j/-coordinate of p respectively. A polygonal 
chain < ui,...,u„ > is said to be X-monotone if x{vi) > x{vj) implies i > j, 
for all i and j between 1 and n. A terrain T in 2D is a X-monotone chain with 
vertices ui, ...,u„. Given two points a,b on or above T, a,b are visible to each 
other (or a, b covers each other) if the segment ab does not intersect any edge of 
T. The visibility region of e G T from a, is the set of all points of e G T which 
are visible to a. These definitions can all be found in [PS85,0’R87,0’R94]. 

2 Algorithm 

In this section, we present implementable polynomial time algorithms for the 
planar 2-watchtower problem. We also consider the more theoretical solutions 
using parametric search. We first consider a discrete version of the problem, 
namely, when the base (bottom vertex) of each of the two watchtowers must be 
at a vertex of T. The following property regarding 2D visibility is easy to prove. 

Lemma 1. Given any point u above T, the non-empty visibility region of any 
edge uiW+T from u is connected; moreover, if x{u) < x(vi) then the non-empty 
visibility region contains Vi+i and if x{u) > x{vi+i) then the non-empty visibility 
region contains Vi. 



2.1 Algorithm for the Discrete Planar 2-Watchtower Problem 

We begin with some definitions. Given a point p on T, we call the minimum 
height watchtower located at Vi whose top can see p the critical watchtower of 
p and Vi. We say that the height of this critical watchtower is the critical height 
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Fig. 1. The definition of critical watchtower. 



of p and Vi and denote it by h(j>, Vi). Clearly h{p, Vi) is the minimum height of a 
watchtower one can build which is located at vt and its top can see p. In Figure 
1 , UiVi is the critical watchtower of p and Vi and its height \uiVi\ is exactly 
h{p,Vi). Similarly we define the critical watchtower of an edge e of T and Vi as 
the minimum height watchtower located at Vi whose top can see every point of 
e and we denote the critical height of this tower by h{e,Vi). In Figure I, M 2 W is 
the critical watchtower of 64 = V 4 V 5 and Vi, and h{e 4 ,Vi) = \u 2 Vi\. (Notice that 
the concept of critical watchtower can be easily generalized to arbitrary point v 
on T.) In the following we first prove some fundamental properties (a necessary 
condition) of the discrete planar 2 -watchtower problem. 

Lemma 2. When i,j are fixed, if UiVi (ujvf) achieves the optimal solution for 
the discrete 2-watchtower problem then Ui,uj collectively see T and there exists 
a point q on T such that h{q,Vj) > h{q,Vi) (h{q,Vi) > h{q,Vj)). 

Based on this lemma, we have the following algorithm to solve the discrete 2- 
watchtower problem. Without loss of generality, we assume that i,j are fixed for 
now. We have to consider two cases: (1) is a vertex of T, q^ v\,q^ and (2) 
g is a point on some edge of S. Notice that in case 1, following Lemma 1, one of 
the edges incident to q must be completely seen by the optimal watchtower UiVi 
{ujVj). Moreover, if q = Vd, d < i, then the edge Vd-iVd must be completely seen 
by Ui- Symmetrically, if q = Vd, d > i, then the edge VdVd+i must be completely 
seen by Ui. Therefore, to handle case 1 we do the following: for each edge e we 
compute h{e,Vi) and h{e,Vj), if h{e,Vj) > h{e,Vi) we build two towers at Vi,Vj 
with height h(e,Vi), otherwise we build two towers at Vi,Vj with height h{e,Vj). 
Then we check whether the tops of the two towers, Ui and Uj, can collectively 
see T. For a fixed e, this can be done in linear time: just compute the visibility 
region of T from Ui,Uj and then check whether every edge can be collectively 
covered by ut and Uj . This can be done in linear time using the standard visibility 
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computation subroutines [EA81,Le83,Jo90,JS87,O’R87]. We record all pairs of 
the towers when their tops can collectively cover T. Among them we pick up the 
pair with the minimum height as Hi(T, i,j). Clearly Hi(T, i,j) can be computed 
in 0{n^) time and 0{n) space as there are 0{n) edges in T. We remark that 
h{e,Vi) over all e can be pre-computed in linear time with the shortest path 
map rooted at Vi [GHLST87], but that will not affect the overall complexity for 
computing j). 

For case 2, where a point in the middle of some edge determines the optimal 
solution for the discrete two-watchtower problem, we do the following. We fix an 
edge e and assume that q must be on e (eventually we need to try on all such 
e). We first prove the following lemma. 

Lemma 3. When i,j are fixed, in case 2, q cannot he to the left (right) of both 
Vi and Vj. In other words, e must be between Vi and Vj. 

Now we show how to handle case 2. Without loss of generality, we assume 
x{vi) < x{vj) and e = VkVk+i is between Vi and Vj. We first compute the 
shortest paths from Vk to Vi and from Vk+i to Vj. Let the two paths be < pi = 
Vk,P 2 , ■■■,Ph = Vi > and < qi = Vk+i,q 2 , q-m = Vj > respectively (Figure 2). 

Notice that if we extend the edges PsPs+i{s = — 2), they naturally 

divide e into 0{h) red intervals and at the same time the intersections of these 
extensions with the vertical line through Vi determines the heights of the towers 
located at Vi. We can define a function Fj.{q) for each point g on e such that it 
is the height of the tower located at Vi and the top of the tower can see q, but 
nothing to the left of it. Clearly Fr{q) is piecewise decreasing function (containing 
0{h) pieces). Symmetrically, we can extend the edges qtqt+i{t = 1, ...,m — 2), 
they also naturally divide e into 0(m) blue intervals. We can also define a 
function Ft{q) for each point q on e such that it is the height of the tower located 




Fig. 2. Handling case 2 for the discrete two-watchtower problem. 
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at Vj and the top of the tower located at Vj can see q, but nothing to the right of 
it. Again Fb{q) is a piecewise increasing function (containing 0{m) pieces). {In 
other words, Fr{q), Fb{q) are exactly h{q,Vi),h{q,Vj) respectively.) Let {X,Y) be 
the coordinate of q, {x,y) be the coordinate of Ui and let Ps = {x{ps),y{ps)) be 
the vertex of T tangent to ulq. The height of upUi is the difference between y 
and y{vi), which is 

{x{vi) - x{ps))^ — + y{Ps) - y{vi). 

X - x{Ps) 

As F is a linear function of X, Fr{q) (and Fb{q)) is of the form 

X — 02 

ai— h 04, 

X -03 

where X represents the a;-coordinate of q which is bounded by x{vk) and x{vk+i), 
and 01 , 02 , 03,04 are constants related to the local geometry. 

Now the problem is to find the intersection of the two functions Fr{) and Fb{) 
at some optimal point q*{e,i,j) (with respect to e). (Geometrically, q*{e,i,j) is 
the point such that if one moves it by any small distance along e, the maximum 
height of the two towers whose top must both see q*{e,i,j) will be increased.) 
What we do is to first merge the red/blue interval points into a sorted list 
(which can be done in 0{h + m) time), then we use prune and search to locate 
the interval which contains the intersection of Fr{) and Fb{). This can be done 
in 0{h + m) = 0{n) time: we first choose the ( ^+™ J_th interval point q', if 
Fr{q') > Pb{l') then we throw away all the interval points to the left of q'] if 
Fr{q') < Fb(q') then we throw away all the interval points to the right of q'; 
if Fr{q') = Fb{q') then we are done. This procedure continues until either we 
have Fr(q') = Ft{q') (in this case q' = q*{e,i,j)) or we have only two sub- 
intervals left, one red and one blue. At this stage we simply solve an equation 
Fr{q) = Fb{q) to obtain q*{e,i,j), which can be done in 0(1) time as this is a 
1-variable quadratic equation. To evaluate Fr{q') and Fb{q') in 0(1) time, we 
just keep track of, for each interval point q' on e, the previous and successive 
red and blue interval points. This can be easily done when we merge the red 
and blue interval points. Therefore, the time it takes to compute the optimal 
solution when both i, j and e are fixed is 

0(n) -I- 0{nj2) + 0(n/4) -|- ... + 0(1) = 0(n). 

It remains to show that the tops of the two towers can collectively cover T. 
This can be done in a similar fashion as in case 1. We just need to compute the 
visibility region of (every edge of) T from Ui,Uj and then check whether each 
edge is collectively covered by Ui and Uj. This can be done in 0(n) time. If Ui, Uj 
cannot collectively cover T, we simply throw them away as valid candidates. As 
we might have 0(n) edges between Vi,Vj we have to try all of them and record 
all the pair of towers whose tops collectively cover T. Among those recorded 
pairs we pick up the one whose height is the shortest as H2{T,i,j). Therefore 
we have the following lemma. 
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Lemma 4. Wheni,j are fixed, the optimal solution for the discrete 2-watchtower 
problem can he obtained in 0{n^) time and 0{n) space. 

With Lemma 4, it is straightforward to have the following theorem as we 
have O(n^) number of combinations of i,j. 

Theorem 1. The optimal solution for the discrete 2-watchtower problem can be 
obtained in O(n^) time and 0(n) space. 

Finally, we show that with parametric search [Me83] the discrete 2-watchtower 
problem can be solved in O(n^log^n) time. The height h of two towers is a 
parameter of the decision problem which asks whether there exist a pair of ver- 
tices Vi and Vj of the terrain T such that every point of the terrain is visible 
from towers of height h located at Vi and Vj. We need two algorithms, sequen- 
tial and parallel, for the decision problem. Let Tg be the running time of the 
sequential decision algorithm. (From the previous discussion, it is easy to see 
Tg = 0{n^).) Let Tp and P denote the running time and the number of pro- 
cessors of the parallel algorithm respectively. The parametric searching scheme 
allows us to solve the optimization problem, i.e., the planar discrete 2-watchtower 
problem, in 0{PTp TpTglogP) time. We use a parallel algorithm of Atallah 
et al. [ACW91] for computing visibility in a simple polygon. Their algorithm is 
based on a divide-and-conquer technique and can be implemented in the weak 
parallel model of computation of Valiant [Va75]. The running time is O(logn) 
using 0{n/ logn) processors. We just need a simplified version of the algorithm 
with O(logn) running time and 0(n) processors. We apply the algorithm in 
parallel for all pairs of Vi,Vj using O(n^) processors. Hence Tp = O(logn) and 
P = O(n^). 

Theorem 2. With parametric search, the optimal solution for the discrete 2- 
watchtower problem can he obtained in O(n^log^n) time. 

Notice that so far it is not possible to actually implement an algorithm using 
parametric search. Therefore, this result is only of theoretical meaning. 

2.2 Algorithm for the Planar 2-Watchtower Problem 

We now discuss the solutions for the planar 2-watchtower problem. Our idea 
is basically the same as that for the discrete case. Again the following lemma, 
which presents a necessary condition for the planar two-watchtower problem, is 
easy to prove. 

Lemma 5. Ifuv (u'v' ) achieves the optimal solution for the planar 2-watchtower 
problem then u, u' collectively see T and there exists a point q on S such that 
h{q,v') > h{q,v) (h{q,v) > h{q,v')). 

Similar to the discrete case, we also have two cases when g is a vertex of T 
and q is on some edge of T. However, for both cases v and v' do not necessarily 
locate on a vertex of T anymore. Let L be the set of lines which are the extensions 
of edges of T. We compute the arrangement of L, A{L). We have the following 
lemma. 
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Lemma 6. In case 1, an optimal solution for the planar 2-watchtower uv has 
the property that either v is a vertex ofT or u is a vertex of A{L). 

Proof. It suffices to prove the second part of the lemma. Assume that u*v* 
is the optimal solution for the planar 2-watchtower problem of T such that v* is 
not a vertex of T and u* is not a vertex of A{L). We show below how to construct 
Wv such that every point of T which is covered only by u* is also covered by u 
and 1^^! < 

By the definition of the problem, under case 1, u* must be on a line I* of 
L and v* is on an edge e of T (Figure 3). If I* and e are parallel then we can 
slide u*v* to either a vertex of e or the closest vertex of A{L) on /*, which 
is defined by I* and l~^ such that some points on T seen by are not seen 
by the top of the other watchtower. Let the resulting watchtower be uv, then 
\uv\ = |M*r!*| and either w is a vertex of T or m is a vertex of the arrangement 
A{L). 




Fig. 3. Illustration for Lemma 6. 



If I* and e are not parallel then we can slide u*v* (this time assume that 
we can always squeeze it) to either a vertex of e or the closest vertex u~^ of 
A{L) on I*, which is again defined by I* and l~^ such that some points on T seen 
by u~^ are not seen by the top of the other watchtower and the height 
becomes shorter in the process of sliding. Let the resulting watchtower be uv, 
then |uv| < 1^*^* | and either is a vertex of T or rt is a vertex of the arrangement 
A{L). □ 

The proof of Lemma 6 naturally gives us an algorithm to solve the problem 
under case 1 in 0{n^) time, by converting the solutions of the 0{n^) discrete 
2-watchtower problems, where the bases of the towers are on some vertices of T, 
to the general problem. We thus have the following lemma. 
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Lemma 7 . In case 1, the optimal solution for the planar 2-watchtower problem 
can be obtained in O(n^) time and 0{n^) space. 




Fig. 4 . Handling case 2 for the general two-watchtower problem. 



Case 2 is similar to the one we handle in the previous subsection. The differ- 
ence is that now v,v' can be on some edges of T. Let v £ e' ,v' G q £ e and 
without loss of generality let e' = ViVi+i, e” = vjVj+i and e = VkVk+i (Figure 4). 
Now the function which defines the critical height h(q,v), Fr(e' ,q), where q£ e 
and V £ e' , is a piecewise decreasing function when v is fixed. Let q = {Xi, Yi) 
and u = (X2,Y2). Fr{e,q) is of the form 



ai (X2 — 02 ) — + 05X2 + ae,bi < X\ < 62 7 ^3 ^ X2 <647 (^) 

Al — 0-4 

where 0^(1 < i < 6),bj{l < j < 4) are constants related to local geometry. 
Similarly, the function Fb{e”,q), where q £ e and v' G e”, defines the critical 
height h{q,v'). It is a piecewise increasing function when v' is fixed and of the 
same form as (A). Notice that once X\ is fixed we can easily maximize and 
minimize the value of (A) in 0(1) time. This evaluation will be used in the 
prune and search step. Similar to the previous section, the prune and search 
step stops when we have obtained an optimal interval point or we have located 
a red and a blue intervals such that Fr{e,q), Fb{e,q) overlap (i.e., within the 
intersection of these two intervals the maximum of Fr{e,q) is larger than the 
minimum of Fb{e, q) and vice versa). 

Finally, if we assume that the following equation with variables Ai, A 2 , A 3 , 



dl{X2 



02 ) 



Al — tt3 
Al — 04 



+ 0^5X2 Oq 



a'AXs 



a'2) 



Xi - a'3 
Ai-< 



+ 0.5^3 + O-Qy 



~{B) 
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where at, a'(l < z < 6) are constants and b\ < X\ < 62, < X2 < 64, b'^ < X3 < 

64 for constants 61, &2, &3, ^4, ^3, ^4, can be solved in /s time (which can be done 
numerically in practice), then the merging of red/blue list, the subsequent prune 
and search step and the checking of whether u, u' can collectively cover T will all 
be done in 0 {n + f^) time with e, e', e” fixed. As we have 0 {n) candidates for e 
and O(n^) number of combinations of e', e”, case 2 can be solved in 0(n‘* + 74^/3) 
time. We hence have the following lemma. 

Lemma 8. In case 2 , if we assume that the above cubic equation (B) can be 
solved in /a time then the planar 2-watchtower problem can be solved in 0{n"^ + 
n^fs) time and 0{n) space. 

Therefore we have the following theorem. 

Theorem 3. Assuming that the above cubic equation (B) can be solved in (ap- 
proximately) /s time then the optimal solution for the planar 2-watchtower prob- 
lem can be obtained in 0{n'^ + n^/3) time and O(n^) space. 

Finally, we show that with parametric search the planar 2-watchtower prob- 
lem can be solved in 0(rz^ log^ n) time. The height H of two towers is a parameter 
of the decision problem which asks whether there exist a pair of towers sliding 
on e' and e” of the terrain T such that every point of the terrain is visible from 
towers of height H located on e' and e”. Let Tg be the running time of the 
sequential decision algorithm. It is easy to see that Tg = O(rz^): for each triple 
e, e',e”, there are 0(n) event points (endpoints of the red/blue intervals) and 
checking whether the terrain T is completely visible can be done in 0 (n) time. 
Let Tp and P denote the running time and the number of processors of the par- 
allel algorithm respectively. It is easy to see that Tp = O(logn) and P = 0(n‘*): 
we use a processor for each e, e', e” and an event point on e (this uses a total of 
0{n‘^) processors); moreover, with these processors it is easy to check in parallel 
whether T can be completely covered in O(logn) time. Hence the total running 
time is O(n^log^n). (Again, this result is only of theoretical meaning.) 

Theorem 4. With parametric search, the optimal solution for the planar 2 - 
watchtower problem can be obtained in O(n^log^n) time. 

3 Closing Remarks 

In this paper, we present several polynomial time algorithms to solve the pla- 
nar 2-watchtower problems. An immediate open problem is how to solve the 
2-watchtower problem in 3D efficiently. (We remark that for the discrete 2- 
watchtower problem in 3D, where the towers must be located at the vertices of 
the terrain, it is possible to use a necessary condition similar to Lemma 2 to 
obtain an 0(n®) time solution.) 
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Abstract. A “rooted” plane triangulation is a plane triangulation with 
one designated vertex on the outer face. In this paper we give a simple 
algorithm to generate all triconnected rooted plane triangulations with 
at most n vertices. The algorithm uses 0(n) space and generates such 
triangulations in 0(1) time per triangulation without duplications. The 
algorithm does not output entire triangulations but the difference from 
the previous triangulation. By modifying the algorithm we can gener- 
ate all triconnected rooted plane triangulation having exactly n vertices 
including exactly r vertices on the outer face in 0(r) time per triangu- 
lation without duplications, while the previous best algorithm generates 
such triangulations in 0{n^) time per triangulation. Also we can generate 
without duplications all triconnected (non-rooted) plane triangulations 
having exactly n vertices including exactly r vertices on the outer face in 
O(r^n) time per triangulation, and all maximal planar graphs in 0{n^) 
time per graph. 



1 Introduction 

Generating all graphs with some property without duplications has many ap- 
plications, including unbiased statistical analysis [M98]. A lot of algorithms to 
solve these problems are already known [A96, B80, LNOl, M98, W86, etc]. See 
nice textbooks [G93, KS98]. In this paper we wish to generate all triconnected 
“rooted” plane triangulations, which will be defined precisely in Section 2, with 
at most n vertices. It is known that every triconnected planar graph has a unique 
embedding on a sphere only up to mirror copy[HW74j. 

To solve these all-graph-generating problems some types of algorithms are 
known. Glassical method algorithms [G93, p57] first generate all the graphs 
with given property allowing duplications, but output only if the graph has not 
been output yet. Thus this method requires quite a huge space to store a list 
of graphs that have already been output. Furthermore, checking whether each 
graph has already been output requires a lot of time. Orderly method algorithms 
[G93, p57] need not to store the list, since they output a graph only if it is a 
“canonical” representative of each isomorphism class. Reverse search method 
algorithms [A96] also need not to store the list. The idea is to implicitly define 
a connected graph H such that the vertices of H correspond to the graphs with 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 131-141, 2001. 

© Springer- Verlag Berlin Heidelberg 2001 




132 



Shin-ichi Nakano 



the given property, and the edges of H correspond to some relation between the 
graphs. By traversing an implicitly defined spanning tree of H , one can find all 
the vertices of H, which correspond to all the graphs with the given property. 

The main idea of our algorithm is that for some problems we can define a 
tree (not a general graph) as the graph H of reverse search method. Thus our 
algorithm does not need to find a spanning tree of H, since H itself is a tree. 
With some other ideas we give the following two simple but efficient algorithms. 

Our first algorithm generates all triconnected rooted plane triangulations 
with at most n vertices. A rooted plane triangulation means a plane triangu- 
lation with one designated “root” vertex on the outer face. For instance there 
are four triconnected rooted plane triangulations with at most five vertices in- 
cluding exactly three vertices on the outer face, as shown in Fig. 1(a). The root 
vertices are depicted by white circles. However, there are only two triconnected 
plane triangulations with at most five vertices including exactly three vertices 
on the outer face. See Fig. 1(b). The algorithm uses 0{n) space and runs in 
0{f{n)) time, where f{n) is the number of nonisomorphic triconnected rooted 
plane triangulations with at most n vertices. The algorithm generates triangu- 
lations without duplications. So the algorithm generates each triangulation in 
0(1) time on average. The algorithm does not output entire triangulations but 
the difference from the previous triangulation. 

By modifying our first algorithm we can generate without duplications all 
triconnected rooted plane triangulations having exactly n vertices including ex- 
actly r vertices on the outer face. The algorithm uses 0(n) space and runs in 
0{r ■ f{n,r)) time, where f{n,r) is the number of nonisomorphic such triangu- 
lations. So the algorithm generates each triangulation in 0{r) time on average, 
while the previous best algorithm [A96] generates such triangulations in 0{ri?) 
time per triangulation. 




Fig. 1. (a) Triconnected rooted plane triangulations, and (b) triconnected plane 
triangulations. 



Also we can generate all triconnected (non-rooted) plane triangulations hav- 
ing exactly n vertices including exactly r vertices on the outer face in 0{r^n) 
time (on average) per triangulation, and all maximal planar graphs in 0{n^) 
time per graph. Our algorithm is simple and does not need the complicated 
theoretical linear-time plane-graph-isomorphism algorithm in [HW74]. 

For full version of the paper see [NOl]. We also solved similar problems for 
biconnected cases [LNOl]. 
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The rest of the paper is organized as follows. Section 2 gives some definitions. 
Section 3 shows a tree structure among triconnected rooted plane triangulations. 
Section 4 presents our first algorithm. By modifying the algorithm we give three 
more algorithms in Section 5. 



2 Preliminaries 

Let G be a connected graph with n vertices. An edge connecting vertices x 
and y is denoted by (x,y). The degree of a vertex v, denoted by d{v), is the 
number of neighbors of u in G. A cut is a set of vertices whose removal results 
in a disconnected graph or a single- vertex graph Ki. The connectivity k(G) of a 
graph G is the cardinality of the minimum number of vertices consisting a cut. 
G is k— connected if fc < k{G). 

A graph is planar if it can be embedded in the plane so that no two edges 
intersect geometrically except at a vertex to which they are both incident. A 
plane graph is a planar graph with a fixed planar embedding. A plane graph 
divides the plane into connected regions called faces. The unbounded face is 
called the outer face, and other faces are called inner faces. We regard the contour 
of a face as the clockwise cycle formed by the vertices and edges on the boundary 
of the face. We denote the contour of the outer face of plane graph G by Go(G). 
A vertex is an outer vertex of G if it is on Go(G), and an inner vertea; otherwise. 
An edge is an outer edge of G if it is on Go(G), and an inner edge otherwise. A 
plane graph is called a plane triangulation if each inner face has exactly three 
edges on its contour. A rooted plane triangulation is a plane triangulation with 
one designated vertex on Go(G). The designated vertex is called the root vertex. 

Let G be a triconnected plane triangulation. Assume G has a cut S consisting 
exactly three vertices, say x, y, z. Let G be the graph derived from G by removing 
S. Let W be the vertex set of a connected componect of G . Let G be the plane 
subgraph of G induced by S' U W. Since G is triconnected each vertex in S has 
at least one neighbor in W. Since G is a plane triangulation, all vertices in S are 
on the some face, say F, of G . We now have the following lemma. 

Lemma 1. Let P he the path on the contour of F between x and y not containing 
z. Then the followings hold, (a) If x and y are inner vertices of G, then G has 
an (inner) edge (x,y). (b) If x is an inner vertex and y is an outer vertex of G, 
then G has an (inner) edge (x,y). (c) If x and y are outer vertices of G, then 
all vertices on P are also outer vertices. 

Thus if G yf AT 4 , then at most two vertices of S are outer vertices, because by 
(c) above if all three vertices in S are outer vertices, then S cannot be a cut. If S 
has at most one outer vertex, then a cut S = {x, y, z} of G is called a separating 
triangle, and by (a) and (b) above G has three edges (x,y), (y,z) and (z,x). If 
S has exactly two outer vertex, say x and y, and exactly one inner vertex, say 
z, then a cut S = {x, y, z} of G is called a separating fan with center z, and by 
(b) above G has two edges {x, z), {y, z). 
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Then we define an operation on a plane triangulation. Assume {u, v) is an 
inner edge of a plane triangulation G. Let u, v, x and v, u, y be the two faces 
having (u,v) on its contour. By (i) deleting the edge (u,v), (ii) merging u and 
V into a new vertex w, and (iii) replacing edges (u,x) and (v,x) by (w,x), and 
edges (u, y) and (w, y) by {w, y), as shown in Fig. 2 (a), we can have a new plane 
graph. This operation is called a contraction of inner edge (u,v). 

Similarly, for an outer edge (u,v) of a plane triangulation G, let u,v,x be 
the face having (w,u) on its contour. By (i) deleting the edge (u,v), (ii) merging 
u and V into a new vertex w, and (iii) replacing edges (u, x) and (v, x) by (w, x), 
we can have a new plane graph. This operation is called a contraction of outer 
edge (m, v). 

Let G be a triconnected plane triangulation. If G has a separating triangle 
{u,v,z}, then the contraction of (u,v) produces “parallel” edge {w,z), so the 
resulting plane graph is not a plane triangulation anymore. If G has a separating 
fan {u, V, z} with center u, then after contracting {u, v), the resulting plane graph 
has a cut {w, z}, and is not triconnected anymore. If G has no cut consisting of 
u, V and exactly one more vertex, then the contraction of (u, v) produces again 
a triconnected plane triangulation. We say such edge (u, v) is contractible. 




Fig. 2. (a) A contraction and (b) a contraction sequence. 



3 The Contracting Sequence and the Genealogical Tree 

Let Sn be the set of all triconnected rooted plane triangulations with at most n 
vertices. Here we explain a tree structure among the triangulations in Sn- 

Let G be a triangulation in Sn but not K^. Let Go(G) = vi,V2, ■ ■ ■ ,Vk, and 
vi be the root vertex of G. A contractible edge (vi,Wi) is the first contractible 
edge of G if none of (ui, W2), • • • , (vi,Wi-i) is contractible edge, where 

(vi,Vk) = (uijWi), (ui, W2), • • • , (ui, Wd(„j)) = (vi,V2) are the edges incident to 
Vi, and assume that they appear around Vi counterclockwise in this order. 

Lemma 2. Every trieonnected rooted plane triangulation except K 4 has the first 
contractible edge. 

For each triconnected rooted plane triangulation G in Sn except K 4 , contracting 
the first contractible edge from G results in also a triconnected rooted plane 
triangulation, denoted by T’(G), in with one less vertices. Thus we can define 
the triangulation P{G) in S'„ for each G in S'„ except K4. We say G is a child 
triangulation of P{G). 
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(3,3) 

Fig. 3. Genealogical tree Tg. 



Given a triangulation G in Sn, by repeatedly contracting the first contractible 
edge, we can have the unique sequence G, P{G),P{P{G)),- ■ ■ of triangulations 
in Sn which eventually ends with K 4 . See an example in Fig. 2(b), in which the 
first contractible edges are depicted by thick lines. 

By merging those sequences we can have the genealogical tree Tn of 5'„ such 
that the vertices of T„ correspond to the triangulations in 5'„, and each edge 
corresponds to each relation between some G and P{G). For instance Tq is 
shown in Fig. 3, in which the first contractible edges are depicted by thick lines. 
We call the vertex in T„ corresponding to K 4 the root of T„. 

4 Algorithm 

Given S'„ we can construct Tn by the definition, possibly with huge space and 
much running time. However, how can we construct T„ efficiently only given an 
integer n? Our idea is by reversing the contracting procedure as follows. 

Given a triconnected rooted plane triangulation G in S'„ with at most n — 1 
vertices, we wish to find all child triangulations of G. Let Go{G) = 1 ) 1 , U 2 , • • • , Ufe, 
and vi be the root vertex of G. Let (wi, Ufc) = (wi, rui), (ui, W 2 ), • • • , (ui, = 

(vi,V 2 ) be the edges incident to vi, and assume that they appear around vi 
counterclockwise in this order. Assume (vi,Ws) be the first contractible edge of 
G. Since especially only K 4 has no contractible edge, for convenience, we regard 
(vi,Ws) = (vi,V 2 ) as the first contractible edge for K 4 . 

We denote by Gi{i,j), where I < i < j < d{vi), the rooted plane triangula- 
tion obtained from G by (1) removing edges (ui, iCi+i), (fi, rUi+ 2 ), • • • , (ui, Wj_i) 
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(thus if i + 1 = j we need not remove any edge), (2) adding new vertex v on 
the inner face of G containing v\,Wi,Wj, (3) adding an edge (wi,u) in the face, 
and (4) adding j — i+1 > 2 edges {v,Wi),{v,Wi+i), - ■ ■ ,{v,Wj), as shown in 
Fig. 4(a). Gi{i,j), where I < i < j < d{vi), is a child triangulation of G if and 
only if (wi,u) is the first contractible edge of Gi{i,j)- (This corresponds to a 
contraction of an inner edge. See Fig. 3.) 

Also, we denote by Go{i,j), where i = d{vi) and 3 < j < d{vi), the rooted 
plane triangulation obtained from G by (1) removing edges (vi,Wj), (vi,Wj+i), 
■■■, (vi,Wd(vi)), (2) adding new vertex v on the outer face of G, (3) adding 
edges (vi,v) and (v,V 2 ), and (4) adding edges (v,Wj-i), (v,Wj), ■ ■ ■ , (v,Wd(vi)), 
as shown in Fig. 4(b). Go{i,j), where i = d{vi) and 3 < j < d{vi), is a child 
triangulation of G if and only if (ui,v) is the first contractible edge of Go{i,j)- 
(This corresponds to a contraction of an outer edge. See Fig. 3.) 





Fig. 4. Illustration for (a) Gi{i,j) and (b) Go{i,j)- 



Thus Gi{i,j) is a child triangulation of G having an inner edge as the first 
contractible edge, while Go(i,j) is a child triangulation of G having an outer 
edge as the first contractible edge. And each child triangulation is either Gi{i,j) 
or Go{i,j) for some i and j. 

For each t, 2 < t < d{vi) — 1, we denote by q{t) the largest index such that 
{wt,Wg(^f-j) is an edge (chord) of G if such edge exists, and q{t) = t otherwise. 
Now we check whether each Gi{i,j) is a child triangulation of G, and check also 
for each Go{i,j)^ as follows. Let (vi,Ws) be the first contractible edge of G. 
Case 1: Gi{i,j), where 1 < i < j < d{vi). We have four subcases. 

Case 1(a): J<s. 

In this case (ui,u) is the first contractible edge of Gi{i,j), and P{Gi{i,j)) = G. 

Case 1(b): i < s < j. 

If G has a separating fan containing vi and Wi, then (ui, v) is the first contractible 
edge for each j, s < j < d{vi) and P{Gi{i,j)) = G. Otherwise, if j < q{i) then 
(wi,v) is the first contractible edge and P{Gi{i,j)) = G. Otherwise (vi,Wi) is a 
contractible edge of G{i,j), and so P{Gi{i,j)) ^ G. 

Case 1(c): i = s. 

If j = i+1, then (ui, u) is the first contractible edge of Gi{i, j) (since vi,Wg, Wg+i 
become a separating triangle) and P{Gi{i,j)) = G. Otherwise, j > i + 2 holds 
then (ui,Ws) is still contractible in Gi{i,j), and P{Gi{i,j)) yf G. 

Case 1(d): i > s. 

(vi,v) is not the first contractible edge of Gi{i,j). Thus P{Gi{i,j)) yf G. 
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Case 2: Go{i,j), where i = d{vi) and 3 < j < d{vi). We have three subcases. 
Case 2(a): s = d{vi). 

(Now each W 2 ,W 3 , ■ ■ ■ ,Wd(vi)-i is contained in some separating fan.) For each 
Go{iij), 3 < j < d{vi), (vi,v) is the first contractible edge, and F(Go(i,j)) = G. 
Case 2(b): s = d{vi) — 1. 

For each G{i,j), 3 < j < d{v\), (vi,v) is the first contractible edge, and 
P{Go{i, j)) = G. Note that in Go{i,d{vi)), vi,Wd(vi)~i,V 2 become a separat- 
ing fan, and other cases are trivial. 

Case 2(c): s < d(vi) — 1. 

If j — 1 > s then (ui,rcs) is still contractible in Go{i,j), thus (viju) is not the 
first contractible edge and P{Go{i,j)) G. Otherwise if G has no separating fan 
containing vi and wj-i then (vi,Wj-i) is contractible in Go{i,j), thus (vi,v) 
is not the first contractible edge, and P{Go{i,j)) ^ G. Otherwise, for each j, 
3 < j < s, (vi,v) is the first contractible edge in Go{i,j)i ^nd P{Go{i,j)) = G. 

Based on the case analysis above we can find all child triangulations of given 
triangulation in S'„. If G has k child triangulations, then we can find them in 
0{k) time. This is an intuitive reason why our algorithm generates triangulations 
in 0(1) time per triangulation. 

And recursively repeating this process from the root of T„ corresponding to 
K 4 we can traverse without constructing whole T„. During the traversal of T„, 
we assign a label (i,j) to each edge connecting G and either Gi{i,j) or Go{i,j) 
T„, as shown in Fig. 3. Each label denotes how to generate a child triangulation 
of G, and each sequence of labels on a path starting from the root specifies a 
triangulation in S'„. For instance (1, 2), (1, 2) specify the uppermost triangulation 
in Fig. 3. During our algorithm we will maintain these labels only on the path 
from the root to the “current” vertex, because those are enough information to 
generate the “current” triangulation. To generate next triangulation, we need to 
maintain some more information (the first contractible edge (■Ui,Ws), and Wg(t) 
for each 1 < t < s, etc.) only for the triangulations on the “current” path, 
which has length at most n. This is an intuitive reason why our algorithm uses 
only 0(n) space, while the number of triangulations may not be bounded by a 
polynomial in n. 

Procedure find-all-child-triangulations(G) 

begin 

1 output G { Output the difference from the previous triangulation} 

2 if G has exactly n vertices then return 

3 for i = 1 to s — 1 

4 for J = z -I- 1 to s 

5 flnd-all-child-triangulations(Gi(z, j)) { Case 1(a)} 

6 for z = 1 to s — 1 

7 if G has a separating fan containing v\ and Wi then 

8 for J = s -I- 1 to d{vi) 

9 find-all-child-triangulations(Gi(z, j)) { Case 1(b)} 

10 else then 
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11 for j = s + 1 to q{i) 

1 2 find-all-child-tr iangulations (Gi (i,j)) 

13 find-all-child-triangulations(Gi(s, s + 1)) 

14 if s = d{vi) or d{vi) — 1 then 

15 for j = 3 to d{vi) 

16 find-all-child-triangulations(Go((i(ui), j)) 

17 else then 

18 for j = 3 to s 

19 if G has a separating fan containing vi and wj 

20 find-all-child-triangulations(Go(d(wi), j)) 
end 

Algorithm find-all-triangulations(n) 
begin 

1 output K 4 

2 if n = 4 then return 

3 G = A 4 

4 find-all-child-triangulations(Gi(l, 2)) 

5 find-all-child-triangulations(Gi(l, 3)) 

6 find-all-child-triangulations(Gi(2, 3)) 

7 find-all-child-triangulations(Go(3, 3)) 
end 

Theorem 1. The algorithm uses 0{n) space and runs in 0{f{n)) time, where 
f{n) is the number of nonisomorphic triconnected rooted plane triangulations 
with at most n vertices. 

5 Modification of the Algorithm 

Then we consider our second problem. Let STf be the set of triconnected rooted 
plane triangulations having exactly n vertices including exactly r outer vertices. 
We wish to generate all triangulations in STf without duplications. Clearly all 
such triangulations are in but with other triangulations. How can we output 
only triangulations in furthermore efficiently? Our idea is by pruning 
in the following way. 

For each triangulation G in S'„ but K 4 , if the first contractible edge is an 
inner edge then P(G) is a triangulation in 5'„ having the same number of outer 
vertices and one less inner vertices, otherwise the first contractible edge is an 
outer edge and P(G) is a triangulation in Sn having one less outer vertices 
and the same number of inner vertices. Thus if G in has exactly n vertices 

including exactly r outer vertices, then every “descendant” triangulation of G 
in T„ can have at most r + (n — n ) outer vertices. Thus ifr>r + (n — n ) 
then none of descendant triangulations of G is in STf . Also if G has r > r outer 
vertices then every descendant triangulation of G has r or more outer vertices, 
and so never in STf. We prune such triangulations from T„. 



{ Case 1(b)} 
{ Case 1(c)} 



{ Case 2(a) and (b)} 



_i then 

{ Case 2(c)} 
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We prune further based on the following lemma. We need some definitions 
here. Let 5*^ be the set of triconnected rooted plane triangulations with n < 
n vertices including exactly r = r outer vertices (so it already has enough 
outer vertices), and S° be the set of triconnected rooted plane triangulations 
with n < n vertices including exactly r = r — {n — n) outer vertices (so 
it already has enough inner vertices). Let S')) be the set of triconnected rooted 
plane triangulations with n < n vertices including r , r — (n — n )< r < r outer 
vertices. Let G be a triangulation in S„, Go(G) = vi,V 2 , ■ ■ ■ ,Vk, and vi be the 
root vertex of G, (vi,Vk) = (ui, wi), (wi, W 2 ), • • • , (wi, Wd(«i)) = (vi,V 2 ) be the 
edges incident to wi, and assume that they appear around v\ counterclockwise 
in this order. We call W 2 the pivot of G. If G has a separating fan containing 
both the root and the pivot of G, then it is called the rp-fan of G. 

Lemma 3. Let G be a triangulation in Sn with n < n vertices including r 
outer vertices. 

(a) IfG has the rp-fan, then (al) ifG has a child triangulation Gi(l, j) for some 
j, then Gi{l,j) has no rp-fan, (a2) if G has a child triangulation Gi(2,j) 
for some j, then Gi(2,j) has the rp-fan, (aS) ifG has a child triangulation 
Gi{i,j) for some i > 3 and j, then Gi{i,j) has the rp-fan, and (a4) if G has 
a child triangulation Go{i,j) for some i and j, then Go{i,j) has the rp-fan. 

(b) IfG has no rp-fan, then (bl ) ifG has a child triangulation Gi{l,j) for some 
j, then Gi{l,j) has no rp-fan, (b2) G has no child triangulation Gi{i,j) for 
any i > 2 and j, and (b3) G has no child triangulation Go{i,j) for any i 
and j. 

Lemma 4. Let G be a triangulation in S„ with n < n vertices including r,r — 
{n — n) < r < r outer vertices. If G has no rp-fan then G has no descendant 
triangulation in Szf. IfG has the rp-fan then G has at least one descendant 
triangulation in Szf. 

Proof. By repeatedly applying Lemma 3(b), if G has no rp-fan then each descen- 
dant triangulation of G has no rp-fan and having exactly r < r outer vertices. 
Thus none of them are in ST)). While if G has the rp-fan then the first con- 
tractible edge is not (ui,'u; 2 ) and then each Go(d(ui), j) is a child triangulation 
of G if and only if, for each w = W 2 ,ws, ■ ■ ■ G has a separating fan con- 
taining w and vi. Such child triangulation is always exist for j = 3. □ 

Now we define a new tree, called the genealogical tree Tf. We say a vertex 
u in a tree with root r has depth d if the path connecting v and r on the tree 
has exactly d edges. Define Tf as the tree such that (i) the vertices of Tf having 
depth n — 4 correspond to the triangulations in ST^, (ii) the vertices of Tf having 
depth at most n — 5 correspond to the triangulations in S)) U S)j U S)), and (iii) 
each edge corresponds to each relation between some G and P{G). The vertex 
in Tf corresponding to K 4 is called the root of Tf. 

Given a triangulation G in U S')^ U S')), we can find all child triangulations 
of G efficiently, by a similar case analysis to the one in the previous section. If 
G has k child triangulations, then again we can find them in 0(fc) time. 
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Procedure find-all-child-triangulations2(C?) 
begin 

1 Assume G has n vertices including r outer vertices. 

2 if G has exactly n vertices then output G return 

3 if r < r and r + {n — n) > r 

4 then { We need more both inner and outer vertices.} 

5 for each Gi{i,j) with i>2 satisfying (a2) or (a3) 

6 find-all-child-triangulat ions2 ( Gi{i,j)) 

7 for each Go (i,j) satisfying (a4) 

8 find-all-child-triangulations2(Go(i , j)) 

9 else if r < r and r + (n — n ) = r 

10 then { We need to increase only outer vertices. } 

11 for each Go(i,j) satisfying (a4) 

12 find-all-child-triangulations2(Go(i , j)) 

13 else r = r holds 

14 then { We already have r outer vertices.} 

15 { We only need to increase only inner vertices.} 

16 for each Gi{i,j) satisfying (al), (a2), (a3) or (bl) 

1 7 find-all-child-triangulat ions2 (Gi (i , j ) ) 
end 

Lemma 5. Let f{n,r) be the number of triangulations in Szf. Then Tf has at 
most r ■ f{n, r) vertices. 

Proof. (Sketch) Each vertex of Tf corresponds to a triangulation in S')) U 5')^ U 
S° U Each vertex with depth n — 4 in corresponds to a triangulation in 
5'=)). Each “ancestor” triangulation G of a triangulation G G S^” has two or 
more child triangulation if G G S” U S)j (since Gi(l,2) and Gi(l,3) are always 
child triangulation of G), and has one or more child triangulation if G G S° and 
G has the rp-fan (since Gi{d{vi),3) is always child triangulation of G). And each 
G G S^” has at most r — 4 “ancestor triangulations which are in G G S°. Thus 
Tf has at most r • f{n,r) vertices. □ 

Since the algorithm needs only a constant time of computation for each edge of 
T”, we have the following lemma. 

Lemma 6. The algorithm uses 0{n) space and runs in 0{r-f{n,r)) time, where 
f{n, r) is the number of nonisomorphic triconnected rooted plane triangulations 
having exactly n vertices including exactly r vertices on the outer face. 

We modify our second algorithm further so that it output all triconnected (non- 
rooted) plane triangulations having exactly n vertices including exactly r vertices 
on the outer face, as follows. 

At each leaf v of the genealogical tree Tf, the triangulation G corresponding 
to v is checked whether the contracting sequence of G with the root vertex is the 
lexicographically first one among the r contracting sequences of G for r choices 
of the root vertex on Go(G), and only if so G is output. Thus we can output 
only canonical representative of each isomorphism class. 




Efficient Generation of Triconnected Plane Triangulations 



141 



Lemma 7. The algorithm uses 0(ji) space and runs in 0{r‘^n ■ g(ji,r)) time, 
where g{n,r) is the number of nonisomorphic triconnected (non-rooted) plane 
triangulations having exactly n vertices including exactly r vertices on the outer 
face. 

A triconnected (non-rooted) plane triangulations G is maximal if G has ex- 
actly 3 vertices on Go{G). For every maximal planar graph by choosing the outer 
face and the root vertex, there are exactly 2m = 3n — 6 (triconnected) rooted 
plane triangulation G with exactly 3 vertices on the outer face, where m is the 
number of edges of G. We modify the algorithm further as follows. At each leaf 
v of the genealogical tree T^, the triangulation G corresponding to v is checked 
whether the contracting sequence of G with the rooted vertex is the lexicograph- 
ically first one among the 2m contracting sequences of G (3 choice of the root 
vertex on Go{G) for each of 2m/3 choice of the outer face of G), and only if so 
G is output. 

Theorem 2. The modified algorithm generate all maximal planar graphs in 
0{n^ ■ h{n)) time, where h{n) is the number of nonisomorphic maximal planar 
graphs with exactly n vertices. The algorithm uses 0{n) space. 
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Abstract. We consider the following problem. Given a polygon P, pos- 
sibly with holes, and having n vertices, compute a pair of equal radius 
disks that do not intersect each other, are contained in P, and whose 
radius is maximized. Our main result is a simple randomized algorithm 
whose expected running time, on worst case input, is 0(n log n). This is 
optimal in the algebraic decision tree model of computation. 



1 Introduction 

Let P be a polygon, possibly with holes, and having n vertices. We consider the 
following problem, which we call 2-disk: Find a pair of disks with radius r* that 
do not intersect each other, are contained in P, and such that r* is maximized. 
Biedl et at [5] gave an 0{n?) time algorithm to solve this problem. 

Special cases of 2-disk have been studied previously. When P is a convex 
polygon, Bose et al. [6] describe a linear time algorithm and Kim and Shin 
[10] describe an O(nlogn) time algorithm. For simple polygons (i.e. polygons 
without holes), Bespamyatnikh [4] gives an O(nlog^n) time algorithm based on 
the parametric search paradigm [11]. 

Another special case occurs when the holes of P degenerate to points. This is 
known as the maximin 2-site facility location problem [3,9]. In this formulation 
we can think of the centers of the two disks as obnoxious facilities such as 
smokestacks, or nuclear power plants, and the points as population centers. The 
goal is maximize the distance between each facility and the nearest population 
center. Katz et al. [9] give an O(nlogn) time algorithm for the decision version 
of the 2-site facility location problem in which one is given a distance d and 
asked if there exists a placement of 2 non-intersecting disks of radius d, each 
contained in P such that no point is included in either of the disks. 

In this paper we present a simple randomized algorithm for the general case in 
which P is not necessarily convex and may contain holes. Our algorithm runs in 
O(nlogn) expected time. It can also be used to solve the optimization version 
of the 2-site max;imin facility location problem in 0(n log n) time. Finally we 
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observe that, when we allow polygons with holes, l7(nlogn) is a lower bound 
for 2-disk by a simple reduction from max-gap. 

The remainder of the paper is organized as follows: Section 2 reviews defi- 
nitions and previous results regarding the medial-axis. Section 3 describes our 
algorithm. Section 4 summarizes and concludes with an open problem. 



2 The Medial- Axis 

For the remainder of this paper, P will be a polygon, possibly with holes, and 
having n vertices. The medial-axis M{P) of P is the locus of all points p for 
which there exists a disk centered at p, contained in P, and which intersects 
the boundary of P in two or more points. See Fig. 1 for an example. Alterna- 
tively, M(P) is a portion of the Voronoi diagram of the open line segments and 
vertices defined by the edges of P. To be more precise, we need to remove the 
Voronoi edges that are outside P and those associated with an edge and one of 
its endpoints. It is well known that the medial-axis consists of 0{n) straight line 
segments and parabolic arcs. 




Fig. 1. The medial-axis of a polygon with a triangular hole. 



Algorithmically, the medial-axis is well understood. There exists an 0{n) 
time algorithm [7] for computing the medial-axis of a polygon without holes 
and O(nlogn) time algorithms for computing the medial-axis of a polygon with 
holes [2]. Furthermore, these algorithms can compute a representation in which 
each segment or arc is represented as a segment or arc in where the third 
dimension gives the radius of the disk that touches two or more points on the 
boundary of P. 

We say that a point p G P supports a disk of radius r if the disk of radius r 
centered at p is contained in P. We call a vertex, parabolic arc or line segment 
X of M{P) an elementary object if the radius of the largest disk supported by 
p G X is monotone as p moves from one endpoint of x to the other. Each edge 
of M{P) can be split into two elementary objects. Thus, M{P) can be split into 
0(n) elementary objects whose union is M{P). 
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3 The Algorithm 

Next we describe a randomized algorithm for 2-disk with O(nlogn) expected 
running time. We begin by restating 2-disk as a problem of computing the 
diameter of a set of elementary objects under a rather unusual distance function. 
We then use an algorithm based on the work of Clarkson and Shor [8] to solve 
this problem in the stated time. 

The following lemma, of which similar versions appear in Bose et al. [6] and 
Biedl et al. [5], tells us that we can restrict our search to disks whose centers lie 
on M{P). 

Lemma 1 . Let D\ and D2 be a solution to 2-disk which maximizes the distance 
between D\ and D2 and let pi and p2 be the centers of Di and D2, respectively. 
Then Di and D2 each intersect the boundary of P in at least two points and 
hence Pi and P2 are points of M{P). 

Proof. Refer to Fig. 2. Suppose that one of the disks, say I?i, intersects the 
boundary of P in at most one point. Let o\ be this point, or if D\ does not 
intersect the boundary of P at all then let o\ be any point on the boundary of 
Di. Note that there is some value of e > 0 such that Di is free to move by a 
distance of e in either of the two directions perpendicular to the direction piol 
while keeping Di in the interior of P. However, movement in at least one of 
these directions will increase the distance \piP 2 \, which is a contradiction since 
this distance was chosen to be maximal over all possible solutions to 2-disk. 




Fig. 2 . The proof of Lemma 1 



Let Xi and X 2 be two elementary objects of M{P). We define the distance 
between xi and X 2 , denoted d(xi,X 2 ) as 2r, where r is the radius of the largest 
pair of equal-radius non-intersecting disks di and ^ 2 , contained in P and with di 
centered on Xi, for i = 1,2. There are two points to note about this definition of 
distance: (1) if the distance between two elementary objects is 2r, then we can 
place two non-intersecting disks of radius r in P, and (2) the distance from an 
elementary object to itself is not necessarily 0. Given two elementary objects it 
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is possible, in constant time, to compute the distance between them as well as 
the locations of 2 disks that produce this distance [5]. 

Let E be the set of elementary objects obtained by taking the union of the 
following three sets of elementary objects: 

1. the set of vertices of M{P), 

2. the set of elementary line segments obtained by splitting each straight line 
segment of M{P) into at most two elementary objects. 

3. the set of elementary parabolic arcs obtained by splitting each parabolic arc 
of M{P) into at most two elementary objects. 

We call the diameter of E the maximum distance between any pair x,y G E, 
where distance is defined as above. Now, it should be clear from Lemma 1 that 
2-disk can be solved by finding a pair of elements in E whose distance is equal 
to the diameter of E.^ 

Thus, all that remains is to devise an algorithm for finding the diameter of E. 
Let m denote the cardinality of E and note that, initially, m = 0{n). Motivated 
by Clarkson and Shor [8] , we compute the diameter using the following algorithm. 
We begin by selecting a random element x from E and finding the element 
x' G E whose distance from x is maximal, along with the corresponding radius 
r. This can be done in 0{m) time, since each distance computation between two 
elementary objects can be done in constant time. Note that r is a lower bound 
on r* . We use this lower bound to do trimming and pruning on the elements of 
E. 

We trim each element y G E hy partitioning y into two subarcs, ^ each of 
which may be empty. The subarc y> is the part of y supporting disks of radius 
greater than or equal to r. The subarc is the remainder of y. We then trim 
from y by removing y from E and replacing it with y> . During the trimming 
step we also remove from E any element that does not support a disk of radius 
greater than r. Each such trimming operation can be done in constant time, 
resulting in an 0(m) running time for this step. 

Next, we prune E. For any arc y G E, the lowest point of y is its closest point 
to the boundary of P. In the case of ties, we take a point which is closest to one 
of the endpoints of P. By the definition of elementary objects, the lowest point 
of y is therefore an endpoint of y. The closed disk with radius r centered on the 
lowest point of y is denoted by D{y). We discard all the elements y G E such 
that D{y) fl D{x) yf 0 for all x G E. 

Pruning can be performed in O(mlogTO) time by computing, for each lowest 
endpoint p, a matching lowest endpoint q whose distance from p is maximal and 
then discarding p if \pq\ < 2r. This computation is known as all-pairs furthest 
neighbors and can be completed in 0(m log m) time [1]. 

Once all trimming and pruning is done, we have a new set of elementary 
objects E' on which we recurse. The recursion completes when \E'\ < 2, at 

^ Here we use the term “pair” loosely, since the diameter may be defined by the 
distance between an elementary object and itself. 

^ We use the term subarc to mean both parts of segments and parts of parabolic arcs. 
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which point we compute the diameter of E' in constant time using a brute- 
force algorithm. We output the largest pair of equal-radius non-overlapping disks 
found during any iteration of the algorithm. 

To prove that this algorithm is correct we consider a pair of non-intersecting 
disks Di and D2, each contained in P and having radius r*, centered at pi 
and p2, respectively, such that the Euclidean distance \piP2\ is maximal. The 
following lemma shows that pi and p2 are not discarded from consideration 
until an equally good solution is found. 

Lemma 2. If, during the execution of one round, {pi,P2} C UE and r < r* , 
then {pi,P2} C UE' at the end of the round. 

Proof. We need to show that at the end of the round, there exists elementary 
objects y 1,1/2 G E' such that pi G yi and P2 G 2/2- More specifically, we need to 
show there exists yi,y2 G E such that pi , respectively p2 is not trimmed from 
2/1, respectively 2/2, and 2/1 and 2/2 are not pruned. 

To see that pi and p2 are not trimmed from any elementary object that 
contains them we simply note that pi and p2 both support disks of radius r* > r 
and are therefore not trimmed. 

To prove that 2/1 and 2/2 are not pruned we subdivide the plane into two open 
halfspaces Hi and H2 such that all points in Hi are closer to pi than to p2 and 
vice-versa. We denote by L the line separating these two halfspaces. 

Recall that, after trimming, an elementary object x is only pruned if D{x) fl 
D{y) yf 0 for all y £ E. We will show that D{yi) C Hi and D{y2) C H2, 
therefore D{yi) fl D{y2) = 0 and neither 2/1 nor 2/2 are pruned. It suffices to 
prove that D{yi) C Hi since a symmetric argument shows that D{y2) C H2. We 
consider three separate cases depending on the location of pi on M{P). 

Case 1: pi is a vertex of M{P). In this case we choose 2/1 to be the singleton 
elementary object {pi}. Thus, D{yi) is centered at pi. Furthermore, the distance 
between pi and L is at least r* > r. Therefore, one point of D{yi) is contained 
in Hi and D{yi) does not intersect the boundary of Hi so it must be that 
D{yi) C Hi. 

Case 2: pi lies in the interior of a straight line segment of M{P). Let p'l be 
the lower endpoint of 2/1- Let 6 be the angle {p2Pi,PiPi) (see Fig. 3). If 0 G 
[— 7r/2,7r/2] then we can move pi slightly in the direction opposite to p'l while 
keeping Di inside P, thus contradicting the assumption that \p1P2 \ is maximal. 
Therefore 9 G [tt/ 2, 37 t/ 2], which implies that D{yi) lies in Hi. 

Case 3: pi lies in the interior of a parabolic arc of M{P). In this case Di is 
tangent to an edge Ci of P and touches one of its vertices v. p'l still denotes the 
lower endpoint of 2/1- Without loss of generality, assume that Ci is parallel to 
the a;-ax;is and x{p'i) < x{pi) (see Fig. 4). Let L' be the line parallel to L that 
crosses the segment [pi,P2] and that is tangent to Di. We denote by oi the point 
where ci is tangent to Di, and we denote by o( the point such that (oi,o^) is a 
diameter of Di. Finally, the convex hull of Ei and D{yi) is denoted by C. 

It must be that x(p2) > x(pi), otherwise pi and Di could be moved in the 
positive X direction while keeping Di in P. This would increase the distance 
\piP2\ which is defined as maximal. It follows that L' is tangent to Di along the 
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Fig. 3. The proof of Lemma 2 Case 2 




Fig. 4. The proof of Lemma 2 Case 3. 



counterclockwise arc (o^, oi). Then L' is tangent to C, so by convexity C lies on 
the same side of L' as p\ which implies that Di is contained in Hi . 

Let di denote the distance of the furthest element in E from Xi, and suppose 
for the sake of analysis that the elements of E are labeled xi,. . . ,Xn so that 
di < di+i- The following lemma helps to establish the running time of the 
algorithm. 

Lemma 3. If we select x = Xi as the random element, then we discard all 
Xj € E such that j < i from E. 
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Proof. For any j < i, either xj does not support a disk of radius greater than 
di, or every point on Xj that supports a disk of radius di is of distance at most 
di from any other point of M{P) that supports a disk of radius di. 

In the first case, Xj is removed from E by trimming. In the second case, 
D{xj) n D{xk) yf 0 for all Xk & E and Xj is removed by pruning. 

Finally, we state and prove our main theorem. 

Theorem 1. The above algorithm solves 2-disk in O(nlogn) expected time. 

Proof. The algorithm is correct because, by Lemma 2, it never discards p± nor 
P 2 until it has found a solution with r = r*, at which point it has already found 
an optimal solution that will be reported when the algorithm terminates. 

To prove the running time of the algorithm, we use the following facts. Each 
round of the algorithm can be completed in 0{mlogm) time where m is the 
cardinality of E at the beginning of the round. By Lemma 3, when we select xt 
as our random element, all elements Xj with j < i disappear from E. Therefore, 
the expected running time of the algorithm is given by the recurrence 

^ m 

Tim) < — r(m — i) + Olmlogm) , 
m 

i=l 

which readily solves to O(mlogm). Since m € 0(n), this completes the proof. 

4 Conclusions 

We have given a randomized algorithm for 2 -disk that runs in 0(n log n) ex- 
pected time. The algorithm is considerably simpler than the O(nlog^n) algo- 
rithm of Bespamyathnikh [4] and has the additional advantage of solving the 
more general problem of polygons with holes. Although we have described our 
algorithm as performing computations with distances, these can be replaced with 
squared distances to yield an algorithm that uses only algebraic computations. 

In the algebraic decision tree model of computation, one can also prove an 
f?(n log n ) lower bound on any algorithm for 2-disk through a reduction from 
MAX-GAP [12]. Suppose that the input to MAX-GAP is yi, . . . Without loss 
of generality one can assume that yi = min{j/i : 1 < i < n} and y„ = max{j/i : 
1 < i < n }. We then construct a rectangle with top and bottom sides at yi and 
y„, respectively, and with width 2(t/„ — j/i). The interior of this rectangle is then 
partitioned into rectangles with horizontal line segments having y coordinates 
yi, . . . , j/„. See Fig. 5 for an example. 

It should then be clear that the solution to 2 -disk for this problem corre- 
sponds to placing two disks in the rectangle corresponding to the gap between yi 
and yi+i which is maximal, i.e., it gives a solution to the original max-gap prob- 
lem. Since this reduction can be easily accomplished in linear time and max-gap 
has an l7(nlogn) lower bound, this yields an l7(nlogn) lower bound on 2 -disk. 

The above reduction only works because we allow polygons with holes. An 
interesting open problem is that of determining the complexity of 2-disk when 
restricted to simple polygons. Is there a linear time algorithm? 
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Fig. 5. Reducing MAX-GAP to 2 -disk. 
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Abstract. In this paper, we consider the problem of computing a max- 
imum cardinality matching among a set 7 of n intervals that are col- 
ored as either red or blue, such that a pair of intervals in I can be 
matched only if they overlap with each other and have different colors. 
This problem arises in some applications such as radiosurgery treatment 
planning. We present a greedy algorithm for this problem that runs in 
0(n log log n) time for sorted input. We also solve a useful generalization 
of this red/blue interval matching problem in the same time bound. 



1 Introduction 

Consider a set of n input intervals, I = {Ii, I 2 , ■ ■ ■ , In}, on the x-axis that are 
colored as either red or blue, where each interval Ii = [le{i), re(i)] is specified by 
its left endpoint le{i) and right endpoint re{i), with le(i) < re{i). Two intervals 
Ii = [le{i),re{i)] and Ij = [le{j),re{j)] are disjoint to each other if re{i) < le{j) 
or re{j) < le{i); otherwise, they overlap. A graph G is called an interval graph if 
there exists a set Ig of intervals such that there is a one-to-one correspondence 
between the vertices of G and the intervals in Iq and such that any two vertices 
in G are connected by an edge if and only if their corresponding intervals in Ig 
overlap. Such an interval set Ig is called an interval model of G. An interval 
graph G is said to be proper if and only if there is an interval model Ig of G 
such that no interval in Ig is contained within any other interval in Ig ■ Interval 
graphs find applications in many areas, such as VLSI design, scheduling, biology, 
medicine, traffic control, and archeology [5]. In this paper, we assume that an 
interval model of the corresponding interval graph is already given (as this is 
often the case in applications). We will refer to interval 7j, interval i, interval 
[le{i),re{i)], and vertex i (corresponding to interval i) interchangeably. 

A matching in a graph G is a subset M of the edges of G such that no two 
distinct edges in M are incident to the same vertex. The problem of computing 
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maximum matchings in graphs has many applications and has received a lot of 
attention [4]. 

In this paper, we consider the following matching problem on a set / of n 
red/blue intervals: Find a maximum cardinality matching M in I such that two 
intervals can be matched in M only if they overlap and their colors are different. 
This problem, in fact, is that of computing a maximum cardinality matching in 
the interval graph G of / that is a bipartite graph [4] . 

The best known maximum bipartite matching algorithm on a general n- 
vertex graph is due to Hopcroft and Karp [6], which takes 0(n^'®) time. To 
our best knowledge, no algorithm for the specific matching problem on bipartite 
interval graphs was previously known. There are only a few known results for 
matching problems on interval graphs and some related graphs. In particular, 
Moitra and Johnson [7] gave an 0(n log n) time algorithm for maximum match- 
ing in ordinary (non-bipartite) interval graphs. For maximum matching in the 
complement graph of an ordinary interval graph, Andrews and Lee [2] designed 
an 0(n log n) time algorithm, and Andrews et al. [1] provided an 0(n) time 
algorithm for sorted input. 

The maximum red/blue interval matching problem that we consider arises 
in the study of radiosurgery treatment planning. Radiosurgery is a non-invasive 
technique for eradicating localized benign and malignant tumors by using ra- 
diation beams. The shape of a radiation beam is formed by a device called 
collimator [12]. The distribution of radiation for a treatment is often prescribed 
by a so-called intensity map of radiation. An intensity map for a multileaf col- 
limator based radiosurgery system [12] may consist of multiple tracks (called 
arcs). A pair of collimator leaves is assigned to each track. On every track, the 
prescribed radiation dose is often delivered as a number of (possibly overlapping) 
sub-tracks, each of which can be viewed as an interval on that track. For exam- 
ple, in Figure 1, we can treat Ij (resp., Ik) as an interval on the track T' (resp., 
T"). A radiosurgery system usually delivers a unit of radiation to a portion of a 
track as specified by one of its sub-tracks, with its two collimator leaves defining 
the two endpoints of such a sub-track. For two consecutive tracks, such as T' and 
T" in Figure 1, suppose that there are two sub-tracks, one on each track, whose 
intervals overlap (here, although the intervals are really on different tracks, we 
consider their overlapping as if the intervals were all on the x-axis). Then the 
two corresponding sub-tracks on the two consecutive tracks can be delivered by 
the radiosurgery system in the same time, thus shortening the total treatment 
time. (The overlapping among the sub-tracks on the same track does not help 
since they must be delivered one at a time.) Suppose we color the intervals on 
T' (resp., T") as red (resp., blue). Then one of the useful problems in treatment 
planning for intensity-modulated radiation therapy with multileaf collimation 
[3,12] is to obtain the maximum number of red/blue interval pairs such that 
the two intervals in each pair overlap. This gives rise to our maximum red/blue 
interval matching problem. 

We also need to study a slightly more general version of the maximum 
red/blue interval matching problem. The endpoints of the sub-tracks (intervals) 
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track T’ 
track T" 




Fig. 1. Illustrating an application of the red/blue interval matching problem. 



in the above radiosurgery treatment planning setting are specified by pairs of 
leaves of a collimator. Due to some mechanical constraints (e.g., for preventing 
two opposite collimator leaves on two consecutive tracks from colliding into each 
other and being damaged), a system-dependent threshold value d > 0 is actu- 
ally specified for the overlapping of any two red/blue intervals [11]. Precisely, the 
length of the overlapping portion of any two such intervals must be no smaller 
than the threshold value d. Hence, the threshold overlapping maximum red/blue 
interval matching problem is to find a maximum matching of n red/blue intervals 
subject to that two red/blue intervals can be matched only if the length of their 
overlapping portion is > d. Obviously, this problem is a generalized version of 
the maximum red/blue interval matching problem. 

We present an O(nloglogn) time algorithm for the maximum red/blue in- 
terval matching problem if the endpoints of the input intervals are already given 
sorted; otherwise, the algorithm takes 0(nlog n) time. Our algorithm is of a 
greedy nature, and is based on the plane sweeping technique [8] and some inte- 
ger data structures [9,10]. For the special case of a proper interval graph with 
sorted interval endpoints, we give an 0{n) time matching algorithm. We also 
show an easy 0(n) time reduction from the threshold overlapping maximum 
red/blue interval matching problem to the maximum red/blue interval match- 
ing problem for sorted input. 

The rest of this paper is organized as follows. Section 2 gives some needed 
preliminaries. Section 3 presents our maximum red/blue interval matching algo- 
rithms. Section 4 shows the reduction from the threshold overlapping maximum 
red/blue interval matching problem to the maximum red/blue interval matching 
problem. 

2 Preliminaries 

The input consists of a set of n red or blue intervals, / = {Ji,/ 2 , . . . ,/«}• The 
endpoints of the intervals in / are arbitrary numbers on the x-axis. Each is 
associated with a color C{Ii) G {red, blue}. To avoid cluttering the exposition, 
we assume without loss of generality (WLOG) that no two input intervals share 
the same endpoint (i.e., the 2n endpoints are distinct). Our algorithms can be 
easily modified for the general case. 

We first sort the 2n endpoints of I from left to right if they are not given 
sorted. The sorting takes O(nlogn) time. From now on, unless otherwise spec- 
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ified, we assume that the 2n endpoints of / are available in this sorted order 
and are stored in an array A. Further, we treat each interval endpoint of / as 
its rank in this sorted array A, i.e., we simply treat each interval endpoint as 
an integer in {1,2,..., 2n}. We also assume WLOG that the intervals in / have 
been relabeled such that i < j implies that re{i) occurs before re(j) in the sorted 
array A of endpoints. 

If the input interval graph is proper, then we assume that no interval in / 
contains another interval in I. 

Given a matching M in I, we say that an interval i is in M if i is matched by 
M. An interval is free if it is unmatched with respect to M . Interval i matching 
with j in M is denoted as mateM{i) = J - A matched pair of intervals i and j in 
M is denoted as match m { i , j)- 

It is known that when the keys of a priority queue are all from a universe 
{1,2, ...,to}, there is a data structure which supports each of the following 
operations in O(loglogTO) time [9,10]: Insert, Delete, Member, Empty, Min, 
Max, Predecessor, and Successor. Our algorithm makes use of such integer 
priority queues (with m = 2n). 



3 The Maximum Red/Blue Matching Algorithms 

Our maximum red/blue interval matching algorithm is based on the following 
key observation. 

Lemma 1. Let J he the interval in I such that its right endpoint re{J) is the 
smallest among the right endpoints of all intervals in I. WLOG, let C{J) = red 
(the case of C{J) = blue is symmetric). Then the following holds. 

(i) Lf J does not overlap with any blue interval in I, then J is free in any 
matching M in L. 

(ii) Lf J overlaps with any blue interval J' in I, then re{J) < re(J'). Further, 
let J" he the blue interval in L that overlaps with J and whose right endpoint 
re{J”) is the smallest among all blue intervals of L that overlap with J. Then 
for any maximum matching M* in I, there is another maximum matching M' 
in I such that mateM'{J) = J" ■ 

Proof: Since the other statements of this lemma are easy to show, we only prove 
that for any maximum matching M* in /, there is another maximum matching 
M' in / such that matcM'iJ) = J"- Let M* be any maximum matching in 
I. Note that it is impossible that both J and J" are free with respect to M* 
(otherwise, we can add {J, J”) to M*, a contradiction to the optimality of M*). 
WLOG, we assume that the pair {J, J") ^ M* (otherwise, M' = M*). We 
consider three cases: (1) J is free and J" is in M* , (2) J is in M* and J" is free, 
and (3) both J and J" are in M* (but (J, J") ^ M*). We will show that in each 
case, there is another maximum matching M' in I such that mateM'{J) = J" ■ 
It is easy to obtain a maximum matching M' in I from M* for Gases (1) 
and (2), as follows: Replace match m* {J' , J") by matchM'{J, J") in Gase (1), 
and replace matchM*{J, J') by matchM'{J, J") in Gase (2), where J' ^ {J, J"} 
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is the interval (with an appropriate color) that is matched with J” or J in M* 
in these two cases, respectively; let M' keep the other matched pairs of M* 
unchanged. 



J" 





Jr 



J 



Jb 



J 



Jb 



(a) (b) 

Fig. 2. Obtaining M' from M* by replacing matchM*{J, Jb) and 
matchM*{J" , Jr) by matchM'{J, J”) and matchM'{Jr, Jb) (the solid line 
segments are red intervals and the dashed line segments are blue intervals) . 



For Case (3), let mateM*{J) = Jb and mateM*{J'') = Jr, where C{Jb) = 
blue and C{Jr) = red. We aim to prove that Jb and Jr overlap (and hence can 
be matched). By the definitions of J and J" , we have re{J) < re{J") < re{Jb) 
(see Figure 2). This and the fact that J and Jb overlap imply that the interval 
[re( J), re( J")] on the x-axis is completely contained by Jb- Also, by the definition 
of J, we have re(J) < re{Jr). Hence, either re{J) < re{Jr) < re{J") (see 
Figure 2(a)) or re{J) < re{J") < re{Jr) (see Figure 2(b)) holds. In either 
situation, due to the fact that J” and Jr overlap, Jr must overlap with the 
interval [re{J),re{J")\. Hence, Jr and Jb must overlap (precisely, on at least one 
endpoint of the interval [re( J), re( J")]), and can be matched. Therefore, we can 
obtain a maximum matching M' in I from M* by replacing match m* (J, Jb) and 
matchM*{J” , Jr) by matchM'{J, J”) and matchM'{Jr, Jb), and letting M' keep 
the other matched pairs of M* unchanged. This finishes the proof of this lemma. 
□ 

Lemma 1 immediately implies the following greedy algorithm: Find intervals 
J and J" in / as defined in Lemma 1, let them form a matched pair, and solve 
the problem recursively on the remaining intervals in I — {J, J"}. Of course, 
an efficient way to carry out this greedy method is by using a non-recursive 
algorithm. Indeed, our greedy algorithm is based on the plane sweeping technique 
[8] and integer priority queues [9,10]. 

We first give an overview of our algorithm. It uses a sweep line to scan the 
endpoints of the intervals of / in the increasing order. It maintains two integer 
priority queues, Qn (for the red intervals) and Qb (for the blue intervals), such 
that all intervals in Qb or Qb are currently being intersected by the sweep line. 
Note that if at the same time, a red (resp., blue) interval J' (resp., J") is stored in 
Qr (resp., Qb), then J' and J" overlap and are possible to form a matched pair. 
An interval J is inserted into (resp., deleted from) the corresponding priority 
queue when its left (resp., right) endpoint le{J) (resp., re{J)) is encountered by 
the sweep line. 
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The details of our algorithm follow next. 

Algorithm Max-RB-Matching(/); 

1. Initialize the integer priority queues Qn and Qb as containing no key values. 
Let L be the sorted list of all endpoints of I in the increasing order. Let the 
output maximum matching in / initially be M* = <j). Mark every interval of 
/ as “free”. 

2. While L is not empty do 

(a) Remove an endpoint p from the head of L. Let J be the interval of / 
such that p is an endpoint of J. WLOG, assume that C{J) = red (the 
other case is symmetric). 

(b) If p is the left endpoint le{J) of J, then insert the right endpoint re{J) 
of J into the priority queue Qr. (Note that re{J) is not removed from 
the list L.) Go to the next iteration. 

(c) If J is already in M*, then go to the next iteration. 

Else, p is the right endpoint re{J) of J. Search in Qb for the immediate 
successor Succq ;^{ re{J)) of re(J) in Qb- 

If SuccQ^{re{J)) does not exist in Qb, then Qb contains no blue inter- 
val. Delete re(J) from Qr. (The interval J is not matched.) 

If SuccQg{re{J)) does exist in Qb, then let it be the right endpoint 
re{J") of a blue interval J". Add a matched pair {J, J") to the output 
maximum matching M*. Delete re{J) from Qr and re{J") from Qb- 
Mark both J and J” as “matched” . 

End of the while loop. 



Lemma 2. Algorithm Max-RB-Matching computes a maximum matching 
for n red and blue intervals in O(nloglogn) time, provided that the endpoints 
of the intervals are already given sorted. 

Proof: The correctness of the algorithm follows from Lemma 1. WLOG, we 
only consider the situation when the right endpoint re( J) of a red interval J is 
encountered by the sweep line, and further, we assume that J is currently not 
in M*. Note that all blue intervals of / that are currently being intersected by 
the sweep line and that are not in M* are stored in the priority queue Qb- If no 
blue interval is in Qb (as found by the search of SuccQg{re{J)) in Qb), then by 
Lemma 1, J should be free. If the immediate successor SuccQg{re{J)) of re{J) 
in Qb is obtained, then again by Lemma 1, (J, J") should be matched in M*, 
where J” is the blue interval having SuccQ^(re{J)) as its right endpoint. 

Analyzing the time complexity of Algorithm Max-RB-Matching is easy. 
Each of the n right endpoints of the n input intervals is inserted once into an 
integer priority queue and deleted once from such a queue, and it gives rise to at 
most one Successor search in such a queue. Since each of the Insert, Delete, 
and Successor operations takes O(loglogn) time [9,10], the total running time 
of the algorithm is O(nloglogn). □ 

We now consider the proper interval graph case (no interval in / contains 
another interval in I). 
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Lemma 3. A maximum matching for a set I of n red and blue intervals can 
be computed in 0{n) time, provided that the endpoints of the intervals in I are 
already given sorted and no interval in I contains another interval in I. 

Proof: This follows quite easily from Lemma 1. We maintain two sorted lists 
Ir and Ib, with Ir (resp., Ir) containing the red (resp., blue) intervals in / 
{Ir U /b = /). Note that since no interval in / contains another interval in I, we 
can order the intervals in Ir and Ir based only on their left endpoints. 

Our 0{n) time algorithm for computing a maximum matching in / is very 
similar to merging two sorted lists. We consider the first element J of Ir and the 
first element J' of Ir. If J and J' overlap, then by Lemma 1, we match J and 
J' and remove J from Ir and remove J' from Ir. Otherwise, J and J' do not 
overlap with each other. Assume WLOG that le{J) < le{J'). Then by Lemma 
1 again, we remove J from Ir (but J' stays in Ir). In either case, we repeat the 
matching process on the remaining elements of Ir and Ir, until one of Ir or 
Ir becomes empty. It is easy to see that since Ir and Ir are both sorted, this 
algorithm takes 0{n) time. □ 

4 Threshold Overlapping Maximum Red/Blue Matching 

As a generalization of the maximum red/blue interval matching problem, we 
solve the threshold overlapping maximum red/blue interval matching problem 
by reducing it to the maximum red/blue interval matching problem. Again, we 
assume that the endpoints of I are already given sorted. However, at this section, 
we need to use the original input values of the endpoints, not their integral ranks 
as in Section 3. 



J 




/Jr(d) 

\b(d) 




Fig. 3. Illustrating Lemma 4 (the dotted line segments are the cut-away por- 
tions). 



Let d > 0 be the specified threshold value for the minimum acceptable length 
of overlapping between any two red/blue intervals. The following simple obser- 
vation is a key to our reduction. 

Lemma 4. For any red (resp., blue) interval Jr (resp., Jb), let Jr{d) (resp., 
Jb{d)) be the interval obtained from Jr (resp., Jb) by cutting a length of d/2 
away from each end of Jr (resp., Jb). Then Jr and Jb have an overlapping 
length > d if and only if Jr{d) and Jb{d) overlap (see Figure 3). 
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Proof: Easy and omitted. □ 

Note that if an interval J oi I has a length < d, then applying to J the cut- 
away operation as defined in Lemma 4 creates an empty interval J{d) that we 
should throw away from further consideration (since J will never overlap with 
any interval in a length > d). We say that such an interval J is short. In fact, all 
short intervals of I are not good and should be identified, because the distances 
between their endpoints are less than the threshold value d. (Thus, they possibly 
cause the opposite collimator leaves on the same tracks, as well as on consecutive 
tracks, that define the endpoints of these and other intervals to collide into each 
other [11].) Hence, all short intervals of I are not useful medically and should be 
removed even before one considers solving the threshold overlapping maximum 
red/blue interval matching problem. WLOG, we assume that there are no short 
intervals in I. Let I{d) be the set of intervals obtained from I by applying the 
cut-away operation of Lemma 4. 

All that is left for us to do is to put the endpoints of /(d) into a sorted order. 
Note that the cut-away operations may cause the order of the endpoints of I to 
be different from the order of the endpoints of /(d), and we want to sort the 
endpoints of /(d) in 0(n) time. 

Let LE{I) (resp., LE{I{d))) be the list of left endpoints of I (resp., /(d)), 
and RE{I) (resp., RE{I{d))) be the list of right endpoints of / (resp., /(d)). 
Note that both LE{I) and RE{I) are already in a sorted order. Since the cut- 
away operations basically shift all points in LE{I) (resp., RE{I)) to the right 
(resp., left) by the same distance of d/2, LE{I{d)) and RE{I{d)) are in the same 
sorted order as LE{I) and RE (I), respectively. Hence, to create a sorted list for 
all endpoints of /(d), we simply merge LE{I{d)) and RE{I{d)), in 0{n) time. 
This finishes our reduction from the threshold overlapping maximum red/blue 
interval matching problem to the maximum red/blue interval matching problem. 

Lemma 5. In 0(n) time, the threshold overlapping maximum red/blue interval 
matching problem can be reduced to the maximum red/blue interval matching 
problem, if the endpoints of the n input intervals are already given in a sorted 
order. 
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Abstract. Let A be a set of n points in convex position in K®. The 
farthest-point Voronoi diagram of N partitions K® into n convex cells. 
We consider the intersection G{N) of the diagram with the boundary of 
the convex hull of N. We give an algorithm that computes an implicit 
representation of G{N) in expected O(nlog^n) time. More precisely, 
we compute the combinatorial structure of G{N), the coordinates of its 
vertices, and the equation of the plane defining each edge of G{N). The 
algorithm allows us to solve the all-pairs farthest neighbor problem for N 
in expected time O(nlog^n), and to perform farthest-neighbor queries 
on N in 0(log^ n) time with high probability. This can be applied to find 
a Euclidean maximum spanning tree and a diameter 2-clustering of N 
in expected O(nlog^n) time. 



1 Introduction 

Let A be a set of n points in three dimensions. Its diameter is the maximum 
distance between any two points in N. The problem of computing the diame- 
ter has been intensively studied in the past two decades. Indeed, back in 1985 
Preparata and Shamos [6] called it a source of frustration to many workers. 
After Clarkson and Shor [4] gave a simple randomized algorithm that runs in 
optimal expected 0(n log n) time, most work on the problem has concentrated on 
finding a matching deterministic algorithm. After considerable efforts by several 
researchers, Ramos [7] and Bespamyatnikh [3] achieved deterministic algorithms 
that run in O(nlog^n) time. Finally, Ramos [8] solved the problem in optimal 
time O(nlogn). 
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The all-pairs farthest neighbors problem for a set N of n points in three 
dimensions is to compute, for each point p in N, the point of N farthest from p. 
This natural generalization of the diameter problem has several applications [1]. 
While all-pairs nearest neighbors in fixed dimension d can be computed in op- 
timal 0(n log n) time [9], no algorithm with similar efficiency is known for the 
all-pairs farthest neighbors. Agarwal et al. [1] showed that 3-dimensional all-pairs 
farthest neighbors can be computed in log^^^ n) time, and pose closing 

the gap between this and the only lower bound of f?(nlogn) as a challenging 
open problem. 

Our work is motivated by the fact that two-dimensional all-pairs farthest 
neighbors can be computed in linear time if the points are the vertices of a 
given convex polygon, even though the problem has complexity l7(nlogn) for 
arbitrary points [2] . We consider the all-pairs farthest neighbor problem for a set 
of n points that form the vertices of a convex polytope in 3 dimensions, and show 
that it can be solved in expected 0(nlog^ n) time. This is the first algorithm on 
a restricted all-pairs farthest neighbor problem in three dimensions that runs in 
near-linear time. 

Our algorithm computes a data structure that is interesting in its own right: 
given a convex polytope V with m vertices, and n sites lying on its surface, 
we compute an implicit representation of the intersection of the farthest-point 
Voronoi diagram of these sites with the surface of V in expected 0{m\ogm + 
nlognlogm) time. 

Our result can be used to solve the following problems, as a direct application 
of Agarwal et al.’s results [1]: 

— Bi-chromatic farthest neighbors: Given a set i? of n “red” points and another 
set B oim “blue” points in 3 dimensions such that RJB is in convex position, 
we find for each red point r G R the farthest blue point from r in expected 
0((n -I- m) log^(n -I- m)) time. 

— External farthest neighbors: Given a set N of n points in 3 dimension in 
convex position and its partition Ni, N 2 , ■ ■ ■ , Nm into m subsets, we compute 
in expected 0(nlog^ n) time for each point p in fV, a farthest point in N\Ni, 
where p G Ni. 

— Euclidean maximum spanning tree: Given a set N of n points in 3 dimensions 
in convex position, we compute in expected 0{nlog^ n) time a spanning tree 
of N whose edges have the maximum total length among all spanning trees, 
where the length of an edge is the Euclidean distance between its endpoints. 
From this tree we can compute a minimum diameter 2-clustering of N in 
linear time. 

In the remainder of the paper, we consider only the original problem of 
computing all-pairs farthest neighbors for a set of points in convex position. 

2 Preliminaries 

Given a set N of points sites in and a point site s not necessarily in N , we 
define the (farthest-point) Voronoi cell Vor(s|A^) of s with respect to N as the 
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set of points a; G such that the Euclidean distance d{x, s) is larger than the 
distance d{x, s') to any site s' £ N with s' yf s. Voronoi cells are convex, and 
may be empty. The (farthest-point) Voronoi diagram of N is the partition of R.^ 
into the Voronoi cells Vor(s|Ai), for s £ N. 

Let now V be the boundary of a convex polytope in three dimensions. Let 
V be a set of point sites lying on V, and s a site on V not necessarily in N. 
The Voronoi cell Vor(s|A^) intersects V in a two-dimensional, possibly empty, 
Voronoi face Vor-p(s|A^). Our first observation is that Voronoi faces are simply 
connected. 

Lemma 1. Let V he the boundary of a 3-dimensional polytope, N a set of point 
sites on V , and s £ N. The Voronoi face Vor-p(s|V) is simply connected, that 
is, its boundary is a simple closed curve. 

Proof. Let p, q be two points in Vor-p(s|V). Let Cs{pq) be the two-dimensional 
cone with apex s spanned by pq, and let Ls{pq) be the intersection Cs{pq) O V. 
Ls{pq) is a path on V connecting p and q. We prove that Ls{pq) lies entirely in 
Vor-p(sjiV). 

In fact, let x £ pq. Since x G Vor(s|Ai), N lies in the sphere S with center x 
and passing through s. If we enlarge S by moving its center along the ray sx 
and keeping s on the sphere, N will remain inside the enlarged sphere. It follows 
that the entire portion of Cg{pq) not in the triangle spq is contained in Vor(s|7V). 
Since Lg{pq) lies in this portion, we have Lg{pq) C Vor-p(sjiV). 

Moreover, if we centrally project Lgfpq) from s, the result is a line segment. 
Since for any two points p, q in Vor-p(s|fV) we have Lg{pq) C Vor-p(s|A^), this 
implies that the central projection of Vor-p(s|fV) from s is convex, and therefore 
simply connected. 

It follows that a set of sites N on V partitions V into simply connected 
faces, defining a planar graph that we denote as G{N) = G-p{N). A face of 
G{N) is a Voronoi face, a vertex of G{N) is a point of equal distance from three 
sites, and therefore the intersection of an edge of the three-dimensional Voronoi 
diagram with V. There can be at most two vertices defined by the same three 
sites (and this case can indeed arise). An edge of G{N) separates two Voronoi 
faces Vor7:5(s|fV) and Vor7:5(s'|A^), and therefore lies on the bisecting plane of the 
sites s and s'. 

Theorem 1. Let N be a set of n sites on a polytope V. Then G{N) = G'p(N) 
has 0{n) vertices, edges, and faces. 

Proof. It follows from Lemma 1 that the number of faces is at most n. A vertex 
has degree at least three. Euler’s formula now implies the linear bound on the 
number of vertices and edges. 

The embedding of an edge e of G{N) in P is a polyline whose vertices are the 
intersections between the embedding of e and the edges oiV. liV has m edges, 
the embedding of e consists of at most m segments. The overall complexity of 
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Fig. 1. An example of G{N) with complexity of I7(n^). V is the convex hull 
of N. (a) If one puts n/2 points equidistantly on the boundary of the bottom 
disk, then their Voronoi cells partition the top disk equally, (b) Add a convex 
roof slightly above the top disk which consists of the remaining n/2 points. The 
number of intersections of the Voronoi cells of the points on the bottom disk 
with edges of the roof becomes 17 (n^). The fat edge of the roof intersects the 
shaded Voronoi cells. 



the embedding of G{N) is therefore 0{nm). This bound is tight, as the example 
of the modified n-Camembert in Fig. 1 shows. 

To achieve subquadratic time, we cannot work with the explicit embedding 
of G{N) into V. Instead, we will use a linear size representation of G{N). The 
representation stores the adjacency relations between vertices, edges, and faces 
of G{N). By Theorem 1 this has linear complexity. In addition, we record for 
each face the defining site, and for each vertex the coordinates of its embedding 
into V. 

We need the following lemma. 

Lemma 2. Let N be a set of sites on a polytope V, and let v be a vertex of 
G{N) = G'p(N). If the sites defining the faees around v are known in the eorreet 
order, then the coordinates of v ’s embedding into V can be computed using a 
single ray shooting query on V . 

Proof. The sites define a line of equal distance, which intersects V in two points. 
The two points differ in the order of sites. The order of sites thus orients the 
line, and we can find the correct intersection point with V using a ray shooting 
query from infinity. 

As a final observation, note that this approach to computing the farthest- 
point 

Voronoi diagram would fail if we tried to apply it to the nearest-point Voronoi 
diagram instead. In fact, the intersection of the nearest-point Voronoi diagram 
of n sites N with the boundary of the convex hull of N can have a quadratic 
number of faces; such an example can be easily designed. 

In the following we will assume N to be in general position, which means 
that no five sites lie on a sphere and no four points lie on a circle. 
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3 The Strategy to Compute G{N) 

Given a convex polytope V with m vertices and a set N of n point sites on 
the surface V, we show how to compute G{N) = G'p{N) in expected time 
0 {m log m + n log n log m). 

The first step of the algorithm is to compute, in O(mlogm) time, a data 
structure that permits ray shooting queries in V with query time 0 (logm). 

We then compute G{N) by randomized incremental construction. We choose 
a random permutation si, S2, S3, . . . , of N, and insert the sites in this order. 

Let TV* = {si, S2, ...Si} be the set of the first i inserted sites. The algorithm 
maintains the implicit representation of G{N^) while sites are added one by one, 
resulting in G(iV”) = G{N). Note that the polytope V defining G(W) does not 
change during the course of the algorithm. 

Our algorithm is mostly a straightforward implementation of the randomized 
incremental paradigm using a conflict graph, and most of the lemmas below are 
analogous to those proven, say, in Mulmuley’s book [5]. We do, however, need 
to cope with an unpleasant aspect of our diagram concerning the maintenance 
of the conflict graph. As we will see, conflicts can “jump” to another edge. We 
need O(logm) time to check a conflict between a site and an edge, which results 
in overall 0(n log n log m) expected time for the computation of G{N). 

4 Conflicts 

A vertex v of G(N*) is said to be in conflict with a site s € iV \ if s is 
farther from v than any of the sites that define v, that is, the sites whose faces 
are adjacent to v. This is equivalent to u G Vor-p(s|iV*). Similarly, an edge e of 
G{N^) is said to be in conflict with a site s G \ fV* if e fl Vor-p(s|A^*) yf 0. 

In addition to a representation of G(A^*), our algorithm maintains a conflict 
list: for each not-yet-inserted site s G N \ N'^, we keep a bidirectional pointer 
to a single vertex A(s) of G{N'‘) in conflict with s. If no vertex of G{N'‘) is in 
conflict with s, we set AT(s) to a single edge of G(A^*) in conflict with s. If no 
edge of G(A^*) is in conflict with s either, then Ai(s) := 0. 

Lemma 3. Let s € N \ A^L // Vor-p(s|A^*) is not empty, then the vertices and 
edges of G{N'^) in conflict with s form a connected subgraph of G{N'^) . 

Proof. Suppose the vertices and edges are not connected. Then we can separate 
them using a curve 7 contained in Vor-p(s| that cuts Vor-p(s| A*) into two non- 
empty components without intersecting vertices or edges of G(A*). This means 
that 7 is entirely contained in a face Vor-p(s'|A*) of G(A*). Since Vor-p(s'|A*) 
is simply connected by Lemma 1, 7 cannot be a closed curve. The endpoints 
of 7 lie in Vor-p(s'|A*) \ Vor-p(s|A*) = Vor-p(s'|A* U {s}). Since this set is con- 
nected by Lemma 1, there is a path 7' connecting the endpoints of 7 through 
Vor7:5(s'| A*) \ Vor73(s|A*). The concatenation of 7 and 7' is a closed curve con- 
tained in Vor-p(s'|A*). It separates the edges and vertices of G(A*), a contra- 
diction to Lemma I. 
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We consider now the insertion of the z + I’st site s^+i into the data structure 
storing G(W). 

Lemma 4. During the insertion o/s^+i, one of the three following situations 
occurs (see Fig. 3): 

(i) If Si+i has no conflicting vertex and no conflicting edge in G{N'^), then 
Vorp(s*+i|W) = 0 and G(W+i) = G(W)- 

(ii) If Si+i has only one conflicting edge e and no conflicting vertex in G{N'‘), 
then Vor 73 (si_|_i |fV*) fi G{N'‘) is a connected portion pq of e. 

(iii) If Si+i has at least one conflicting vertex in G{N'^), then at least one endpoint 
of each edge in conflict with s^+i is also in conflict with s^+i. 

Proof. If Si+i has no conflicting vertex and no conflicting edge in G(iV*), then 
Vor7:5(si_i_i |iV®) must be empty; otherwise, Vor-p(si+i| would lie entirely within 
some face of G(./V*), which is impossible since the face is simply connected in 
G(iV*+^). Thus if Vor73(si+i|W) 0, then there must be a vertex or an edge of 
G(fV*) in conflict with 

Suppose now that an edge of G(iV*) conflicts with Si+i, but that none of 
its endpoints conflicts with Si+i. Then Lemma 3 directly implies that no other 
vertex or edge of G{N'‘) is in conflict with Sj+i. 

The third case trivially holds because the vertices and edges in conflict with 
Si+i form a connected subgraph of G(W) by Lemma 3. 





(a) (b) 

Fig. 2. (a) G(iV^). No vertex of G{N^) is in conflict with 54 , but edge e conflicts 
with S4. (b) G{N'^). 



The algorithm implements the three cases of Lemma 4. Consider the insertion 
of Si+i into the data structure for G(W). We first follow the conflict pointer of 
Si+i to find the conflict A'(si+i) G G(fV*) in constant time. 

If AT(si_|_i) = 0, we have case (i) of the lemma, and nothing needs to be done. 
If Ai(si+i) is an edge, no vertex of G(fV*) is in conflict with s^+i. We have 
therefore case (ii) of the lemma, and there is no other conflict of at all. We 
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can update our data structure to represent by removing a portion of e 

and replacing it with an eye-like subgraph that consists of two edges induced by 
the two bisector planes between Sj+i and each of the two sites defining e in iVb 
See Fig. 3. Updating the adjacency relations takes constant time. However, we 
also need to compute the coordinates of the two new vertices. By Lemma 2, this 
can be done by two ray shooting queries on V in time O (log to). 

If X(si+i) is a vertex, we have case (iii) of the lemma. By Lemma 3, the por- 
tion of G(iV*) lying inside Vor-p(si+i |fV*) is a connected subgraph G of G(fV*). 
We identify G by traversing G(fV*). This takes time linear in the size of G, as 
we only need to test conflicts between vertices and s^+i, which takes constant 
time per test. The extremal edges of G lie on the boundary of the new face 
Vor-p(si_|_i|iV*). We shorten these edges by creating new vertices. After generat- 
ing the new boundary by connecting these vertices, we finally delete G. If the 
complexity of G is k, all this can be done in time O(klogm) using Lemma 2. 
See Fig. 3. 





Fig. 3. Illustrating two cases of Lemma 4. 



5 Conflict Maintenance 

It remains to see how we update the conflict list during the insertion of a new 
site Si+i. Recall that we maintain for every site s G N\N'^ a bidirectional pointer 
to a vertex or edge of G{N'^) in conflict with s, if there is one. More precisely, 
if there are vertices of G(W) in conflict with s, then X(s) is one of them. If 
no vertex of G{N'^) is in conflict with s and Vor-p(s|fV*) yf 0, then there must 
be an edge e C G{N'^) in conflict with s by Lemma 4 (ii). In this case, we set 
A(s) = e. Notice that if Ai(s) is an edge, then no vertex in G(N*) conflicts with 
s (see Lemma 4 (ii)). 

When inserting Si+i, a new face Vor-p(si+i |fV*) is defined. If Vor-p 
0, then the vertices and edges in conflict with will be destroyed in G{N'^~^^). 
These are exactly the vertices and edges of G(iV®) that intersect Vorp(s 
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Let s be a non-inserted site in TV \ If AT(s) intersects Vor73(s 

then X(s) is not defined in G(iV*+^) any more, and we need to update A'(s) 
by finding a vertex or edge of in conflict with s. Otherwise, namely if 

AT(S') does not intersect Vor-p(si+i|-/V*), then we do not need to update AT(s). 
The next lemmas will be used for such conflict update. The boundary of a set 
R is denoted by dR. 

Lemma 5. Suppose that s £ N \ and vA(s) 0 Vor-p(si+i|iV*) yf 0. If 
Vor-p(s|iV*) C Vor-p(sj+i|Ai*) then Vor-p(s|fV*+^) = 0. Otherwise Vor-p(s|Al*) 0 
(9Vor-p(si+i|Ai*) C Vor-p(s|Ai*+^). 

Proof. If Vor-p(s|A^*) C Vor-p(sj+i|A^*), then all the vertices and edges of G(N^) 
lying inside Vor-p(s|Ai*) would be destroyed at the end of step i + I. It is equiv- 
alent to say that no vertex and no edge is in conflict with s over By 

Lemma 4 (i) it follows that Vor-p(s|A^*+^) is empty. 

Let now x S Vor-p(s|Ai*) 0 9Vor-p(si_|_i |iV*). Since x G 9Vor-p(si+i |7V*), 
there is some s' £ N' such that d{x,Si+i) = d{x,s'). However, since x G 
Vor-p(s|iV*), we have d{x,s) > d{x,s'), and so d{x,s) > d(x, Sj+i). This im- 
plies X £ Vor-p(s|A^*+^). 

A point p is said to be visible from a point q within G{N') if there is a path 
connecting q to p whose interior does not intersect any vertex or edge of G(fV*). 
Similarly, an edge e is said to be visible from a point q within G(iV*) if there is 
a point on e that is visible from q. 





Fig. 4. The proof of Lemma 6 (ii). (a) A'(s) is a vertex, (b) AT(s) is a portion of 
an edge. In this example, z = a and x = b. 



Lemma 6. Suppose that s is a non-inserted site in N \ and AT(s) fl 

Vor-p(si+i|iV*) 

yf 0. If\or'p{s\N'''^^) yf 0, then one of these two cases must occur: 

(i) s has a conflicting vertex v G Vor-p(s|iV®) fl 9Vor-p(si_|_i |iV*) in G{N'^^). 
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(ii) S has no conflicting vertex in Vor-p(s| n 9 Vor-p(si+i |./V*), but has a single 
conflicting edge e € dVorp(si+ijN‘) such that e is visible from a point p € 
G{N^) n Vor 73(54+1 1 n Vor73(s|A^*) within G{N'^). 

Proof. By Lemma 5 , each point of Vor-p(s|A^*) fl 9 Vor-p(si+i|A^*) is in conflict 
with s. Deflne G = Vor73(s|A^*) fl 9Vor73(si+i|Ai*). Then C is a connected chain 
due to Lemma 1 . If C contains a vertex of G{N'^''^^, case (i) is true. 

If C is a portion of an edge e of (9Vor73(si+i|A^*), we need to prove that e 
is visible from some point p G G{N^) fl Vor 73 (si+i |A^*) fl Vor 73 (s|A^*). For the 
following description, see Fig. 4 . 

Let z be a point of G. Let x be a point of Ai(s) fl Vor73(si+i |A^*). Note here 
that if X(s) is a vertex, then x = Ai(s); if Ai(s) is an edge, x is any point 
of Ai(s) which belongs to Vor-p(si+i|Ai*). Consider an arbitrary simple path 7 
connecting a; to z within Vor73(s|A^*) (not within Vor73(s| Since x lies 
in Vor-p(si+i|A^*) and z does not lie in the interior of Vor-p(sj+i|Al^*), 7 must 
intersect 9 Vor-p(si+i|Ai*) at least once, possibly in z. We denote by a the first 
such intersection on the way from x to z. Let e be the edge of clVor-p 
containing a. 

Let b be the last intersection of G(A^*) with 7 on the way from x to a. If 
b = X, then it means e is an edge bounding some region in G(N^) incident to 
X, so e is visible from x, that is, p = x. li b ^ x, Lemma 4 (iii) implies that an 
endpoint p of the edge of G{N'^) containing b conflicts with s. It means that a is 
visible from p, so e is visible from p, and e intersects 7. Hence e conflicts with s 
in G(Ai*+i). 



Lemma 7 . For a site s € N \ and a given vertex or edge of G{N'‘), we can 
decide in O(logm) time whether they are in conflict. 

Proof. A vertex w G G{N'^) is in conflict with s if and only if w is farther from 
s than the three sites in N'^ that deflne w. This can be checked in constant 
time because we have access to the coordinates of w and the sites defining the 
adjacent faces. 

Suppose now that we want to check whether an edge e is in conflict with a 
site s. Let s', s" G be the sites defining the faces adjacent to e, and let tt be 
the bisecting plane of s', s" . The edge e is embedded in tt, and is in conflict with 
s if and only if it intersects the halfplane tt' := {x G tt | d{x, s) > d{x, s')}. We 
test this by ray shooting on V in time O(logm). 

We are now ready to describe how the conflict list is updated during the 
insertion of a new site s^+i. We first collect all not-yet-inserted sites s whose 
conflict is destroyed by the insertion of s^+i. This can be done during the ex- 
ploration of the subgraph. Since A(s) is being deleted in G(N*+^), we need to 
either And a new conflicting object in G{N'^^), or to And out that there is none, 
which means that Vor73(s|Af-^ ) = 0 for all j > f -I- 1 by Lemma 5 . 

Suppose that Vor-p(s|A^*“'"^) is not empty. By Lemma 6, we can And a new 
vertex or edge in conflict with s by exploring the subgraph of G{N') belonging 
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to Vor-p(s|Ai*) n Vor-p(sj+i|Ai*) as follows. If Al(s) is an edge, it may have a 
conflict with a new vertex of 9Vor7i7(si_i_i |fV*) that is defined in the interior of 
X(s) n dVor-p(si+ijJV^) or a new edge that is visible from any point in A'(s) fl 
Vor-p(si_|_i|iV*). In the latter case, there can be at most two such edges, thus we 
can check if the edges conflict with s. If they do not, then s can be discarded 
since its Voronoi region is empty. 

On the other hand, if Al(s) is a vertex, then we first look for a new conflicting 
vertex by walking among the vertices of G{N'^) in conflict with s until we reach a 
vertex of 9Vor7:5(si+i|Ai*). If we do not And such a vertex, we still have to check 
if there is a new conflicting edge of i9Vor7:5(si+i|A^*) which is visible from some 
vertex visited so far (see Lemma 6 (ii)). If we fail to And a conflicting edge, then 
s can be discarded. 

6 Analysis 

Two of our primitive operations — creating a Voronoi vertex and detecting con- 
flict with an edge — require ray shooting query on V, and therefore take O(logm) 
time. All other primitive operations take constant time. After the initial prepro- 
cessing in time O(mlogm), a backward analysis argument (see pp. 98-108 in [5]) 
shows that our algorithm performs an expected number of 0(n log n) primitive 
operations. This implies a running time of 0(mlogm + nlognlogm). 

Theorem 2. Given a polytope V with m vertices, and n points N on the sur- 
face of poly, we can compute G{N) = G'p(N) in expected time 0{mlogm -\- 
n log n log m) . 

7 Point Location Queries and Farthest Neighbors 

Once we have computed the implicit representation of the farthest-point Voronoi 
diagram of N on V, we can easily And a point in the interior of each face 
Vor-p(s|iV). For example, take two non-adjacent vertices in an ordinary face, 
or the two vertices of an eye, and shoot a ray from s towards their midpoint. 
Then, run the same algorithm again, but now computing a radial triangulation 
of G{N) (see pp. 109 in [5]). Each of its faces is the intersection of V with a 
cone (ss' , sv, sv') where s G N, s' is the interior point for Vor-p(s|fV), and v,v' 
are consecutive vertices of Vor-p(s|V). During the second pass, we maintain for 
each site, whether it has been inserted or not, a pointer to the face of the radial 
triangulation that contains it. One can see easily that this does not hurt our time 
bound. It allows to And for each site the face of G{N) it belongs to. Still follow- 
ing [5] we can build a point location data structure for our radial triangulation 
that answer queries in 0(log^ n) time with high probability. The reason why 
we need the radial triangulation for these two results is a configuration space 
argument, namely we want the faces of our graph to have bounded degree. 

Theorem 3. Given a set N of points in 3-dimensional convex positions, all- 
pairs farthest neighbors can be computed in expected O(nlog^n) time. A data 
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structure that answers farthest-neighbor queries in 0(log^ n) time with high prob- 
ablility can be build within the same time bounds. 

8 Conclusion and Open Problems 

The main open problem is to fill the gap between the time complexity of our 
algorithm, namely O(nlog^n), and the lower bound Q{n). Besides, one can try 
to find a deterministic equivalent, but this looks hard. Indeed, a simple 0{n log n) 
algorithm [4] for the diameter problem has been found in 1989 but a deterministic 
O(nlog^n) [7] time algorithm was only published in 1997. 
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Abstract. Let 7r(a, b) denote the shortest path between two points a, b 
inside a simple polygon P, which totally lies in P. The geodesic distance 
between a and 6 in P is dehned as the length of Tr{a,b), denoted by 
gd{a, b), in contrast with the Euclidean distance between a and b, denoted 
by d{a, b). Given two disjoint polygons P and Q in the plane, the bridge 
problem asks for a line segment {optimal bridge) that connects a point p 
on the boundary of P and a point q on the boundary of Q such that the 
sum of three distances gd{p' ,p), d{p, q) and gd{q,q'), with any p' £ P and 
any q' £ Q, is minimized. We present an 0(n log® n) time algorithm for 
finding an optimal bridge between two simple polygons. This significantly 
improves upon the previous 0{n^) time bound. 



1 Introduction 

The bridge problem, introduced by Cai et al. [2], is defined as: Given two dis- 
joint regions, we want to build a bridge to connect them so as to minimize the 
length of the longest path between two points in different regions. Because of 
its similarity to many geometric problems (e.g., diameter problems, minimum 
separation problems and minimum spanning tree problems), much progress has 
been made, immediately after the problem is posed. For the simple case where 
two polygons are convex, two optimal algorithms have been presented by Kim 
and Shin [5] and by Tan [8], independently. Kim and Shin also reported an O(n^) 
time algorithm for finding an optimal bridge between two simple polygons [5]. 
A linear time solution to the case of two rectilinear polygons is given in [10]. 
Efficient algorithms for high-dimensional bridge problems can be found in [8,9]. 

In this paper, we present an 0(nlog^ n) time algorithm for finding an optimal 
bridge between two simple polygons with a total of n vertices. Our result is 
obtained by making use of a hierarchical structure that consists of the well- 
known segment trees, range trees and persistent search trees [6,7], and a structure 
that supports dynamic ray shooting and shortest path queries as well [3]. 

2 Necessary Conditions for Optimal Bridges 

Let 7r(a, b) denote the shortest path between two points a, b inside a simple 
polygon P, which starts at a, ends at b and totally lies in P. The length of 
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7r(a, b) is the geodesic distance between a and b in P, and is denoted by gd{a, b). 
Usually, the Euclidean distance between a and b is denoted by d{a, b). 

Let P and Q denote two disjoint polygons in the plane with boundaries dP 
and dQ, respectively. The bridge problem considered in [2] deals with finding a 
line segment that connects a point p £ dP and a point q € dQ so that the length 
function maXpi^ p{gd{p' , p)} + d{p, q) + maXqr^Q{gd{g, q')} is minimized. We call 
the segment (p,q) the optimal bridge between P and Q. 

We will first review necessary conditions for optimal bridges between two 
convex polygons given in [8], and then generalize them to simple polygons. 

2.1 The Case of Two Convex Polygons 

Notice that in the case of convex polygons, gd{p',p) = d{p',p) and gd{q,q') = 
d{q, q'). Let V (P) denote the furthest-site Voronoi diagram of vertices of polygon 
P and V(pi) the furthest-site Voronoi region of a vertex pi [6]. Let pij denote 
the intersection point of dP with the common edge of regions V{pi) and V{pj). 
Define the infinite triangular region, bounded by two half lines being collinear 
with but not containing the segments {pi,pij) and (pj,pij), as the fan of pij, 
and denote it by F{pi,j) (see Fig. lb). The fan F{pij) specifies the area to which 
the candidate bridges with p = pij belong (see Lemma 1). Also, let pn and Pi 2 
be two intersection points of dP with the boundary of a region V (pi). Define the 
fan of the vertex pi, denoted by F{pi), as the triangular region bounded by two 
half lines passing through and containing the segments {pi,pn) and {pi,Pi 2 )- The 
fan F{pi) specifies the area to which the candidate bridges with p' = pi belong. 
The same definition also applies to polygon Q. 

Lemma 1 The points p and p' of an optimal solution must satisfy one of the 
following conditions (the same conditions symmetrically hold for the points q 
and q' ): 

(1) p is the intersection point of dP with the common edge ofV{pi) and V{pj), 
and the fan F{p) contains the other point q of the optimal bridge, or 

(2) p is the intersection point of dP with the segment {p', q) and the fan F(p') 
contains the point q, where q satisfies the condition (1), or 

(3) p is the intersection point of dP with the segment (p' , q'), and the fans F(p') 
and F{q') contain the vertices q' and p' , respectively. 

Proof. We will give a simple proof of the condition (1) and omit the proofs 
of conditions (2) and (3) (see also [8]). 

Consider first the case where any two of segments (p',p), (p,q) and {q,q') 
of an optimal solution are not collinear. Assume that vertex pi is the point p' 
of the optimal solution. Obviously, a contradiction occurs if pi is not the vertex 
farthest to the point p of the optimal bridge in polygon P. If pi is the only 
vertex farthest to the point p in P (Fig. la), or pi and another vertex pj are the 
vertices farthest to the point p (= pij) but the point q of the optimal bridge 
is not contained in the fan Ffp^ j) (Fig. lb), we can move p by a very small 
distance to obtain a new point p* such that p* and q are mutually visible, pi or 
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Fig. 1. Illustration for the proof of Lemma 1 . 



Pj is still the vertex farthest to p* , and d{pi,p*) + d{p*,q) < d{pi,p) + d{p, q) or 
d{pj,p*) + d{p*,q) < d{pj,p) + d{p,q), a contradiction. Therefore, the points p 
and p' of the optimal solution satisfy the condition (1). □ 



2.2 The Case of Two Simple Polygons 

Given a simple polygon P' of m vertices and a set S' = {si, . . . , Sfc} of k sites 
inside P', the farthest-site geodesic Voronoi diagram of S in P' is the partition 
of P' into k regions GV{si), . . . ,GV{sk) (some of them may be empty) such 
that a point x G P' belongs to GV{si) if and only if the geodesic distance of 
site Si to X is farther than that of any other site to x. An 0((m + k) log(m + k)) 
time algorithm has been presented to compute the farthest-site geodesic Voronoi 
diagram of k sites in a simple polygon of m vertices [1] . 

Let GV (P) be the farthest-site geodesic Voronoi diagram of vertices of the 
simple polygon P. Let Pi, Pj denote two vertices of P such that their regions, 
GV{pi) and GV{pj), are adjacent, and pij the intersection point of dP with 
the common edge of GV{pi) and GV{pj). Similarly, the fan F{pij) needs to be 
defined. Let L\ and L 2 be two infinite rays shot at pij, being collinear with 
but not containing the first segments of two paths Tr{pij,pi) and Tr(pij,pj), 
respectively, and let T{Li,L 2 ) denote the triangular region bounded by Li and 
L 2 (see Fig. 6). Since parts of polygon P may appear in T{Li,L 2 ), i.e, they 
block the visibility of the point pij, region T{Li,L 2 ) cannot simply be defined 
as the fan F{pij). Let [Pij,Pi j] be the maximal interval on dQ such that p\ ^ 
and pF He in T{L\, L 2 ) and both of them are visible from pij. We then define 
the fan F(pij) as the triangular region bounded by two half lines being collinear 
with and containing the segments (pij^Pij) and {pij,pFf 

For a non-empty region GV{pi), we further divide the boundary of GV{pi), 
overlapped with dP, into the intervals such that all points within an interval 
have (or are visible from) the same first turn point in their shortest paths to pi. 
See Fig. 2 for an example. Note that a first turn point may have several intervals. 
In [I] , Aronov et al. referred to such a first turn point as the anchor of all points 
of the interval with respect to pi, and defined a refined Voronoi diagram so that 
all points in a refined region have the same furthest vertex and the same anchor 
with respect to the furthest vertex. 
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Fig. 2. First turn points (anchors) with their intervals. 



Let Pi{a) be an anchor with respect to pi, and [pi{ai),pi{a 2 )] one of its in- 
terval(s). See Fig. 2. To define the fan F{pi{a)) with respect to the interval 
[pi{ai),pi{a 2 )], we modify dP by removing the interval [pi{ai),pi{a 2 )], and by 
inserting the edges (pi{a) , pi{ai)) and {pi{a),pi{a 2 ))- The definition of F{pi{a)) 
is then similar to that of F{pij), with a slight modification that T{Li,L 2 ) is 
replaced by the region bounded by two half lines being collinear with and con- 
taining (pi{a) , pi{ai)) and {pi{a),pi{a 2 ))- (An anchor has as many fans as the 
intervals it has.) The same definitions also apply to polygon Q. 

Lemma 2 Let p' {a) and q'{a) denote the anchors of p and q with respect to 
p' and q' , respectively. The points p and p' of an optimal solution must satisfy 
one of the following conditions (the same conditions symmetrically hold for the 
points q and q' ): 

(1) p is the intersection point of dP with the common edge of GV{pi) and 
GV{pj), and F{p) contains the other point q of the optimal bridge, or 

(2) p is the intersection point of dP with the segment (p'{a),q) and F{p'{a)) 
contains the point q, where q satisfies the condition (1), or 

(3) p is the intersection point ofdP with the segment (p'{a), q'(a)), and F{p'{a)) 
and F{q'{a)) contain the anchors q'{a) and p' {a) respectively, or 

(4) {p,q) is a line segment that touches a vertex v of P or Q, and three points 
p'{a), q'{a) and v are in the same side of the line passing through (p,q). 

Proof. Consider first the case where the optimal bridge {p, q) is not collinear 
to the first segment of the path tt{p,p') or Tr{q,q'). Assume that vertex pi is 
the point p' of the optimal solution. In the case where Pi is the only vertex of P 
farthest to the point p in the geodesic metric, or pi and another vertex pj are the 
vertices farthest to the point p (= pi^j) but the point q of the optimal bridge is not 
contained in the fan F{pij), a contradiction occurs, provided that point p can be 
moved by a small distance to get a smaller value of gd{p ' , p) + d{p, q) + gd{q, q') . 
The condition (1) is satisfied in this case. 

The conditions (2) and (3) can be proved by an argument similar to the 
proof of Lemma 1 (see [8] for a full proof), with a slight modification that p is 
the intersection point of dP with the segment (p'{a),q) or {p' (a) , q' (a)) . 

Since polygons P and Q are simple, the bridge (p, q) may touch a vertex v 
of P or Q. See Fig. 3. (In this case, point p cannot simply be moved to give a 
contradiction.) Without loss of generality, assume that u is a vertex of P. Let pi 
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Fig. 3. A simple case in the rotation of the bridge {p, q) . 



(resp. qj) denote the vertex of P (resp. Q) farthest to the point p (resp. q) in the 
geodesic metric, Pi{a) (resp. qj{a)) the anchor of p (resp. q) with respect to Pi 
(resp. qj), and q{pi{a)) (resp. p{qj{a))) the intersection point of dQ (resp. dP) 
with the line passing through {v,pi{a)) (resp. (v,qj{a))). See Fig. 3 for an exam- 
ple. Obviously, the optimal bridge cannot be located at the position of (p*,q*), 
where p* (or q*) is not contained in the triangle determined by Pi{a), p{qj{a)) 
(or qj{a), q{pi{a))) and w; otherwise, d{p^{a),p{qj{a))) + d{p{qj{a)),qj{a)) < 
d{pi{a),p*) + d{p*,q*) + d{q* ,qj{a)), a contradiction (Fig. 3). Consider a pro- 
cess of rotating the bridge {p, q) around point v, with the point p contained 
in the triangle described above. Assume first that nothing is encountered in 
the process of rotating the bridge (p,q), i.e., it begins with p = p{qj{a)) and 
ends with q = q{pi{a)). The sum d{pi{a),p) + d{p,v) is monotone decreasing in 
the rotating process, while the sum d{v,q) + d{q,qj{a)) is monotone increasing. 
Therefore, the length function d{pi{a),p) + d{p,v) + d{v,q) + d{q,qj{a)) as well 
as gd{p',p) + d{p,q) + gd{q,q') is convex with a unique local minimum. If the 
minimum occurs at point p* in the rotating process, then (p*, q*) is a candidate 
bridge. This gives the condition (4). 

In the rotating process, the bridge (p, q) may touch two vertices of P or/and 
Q, and the vertex v of polygon P or the anchor qj{a) may be changed. Consider 
first the situation where the endpoint q of the current bridge reaches the endpoint 
of qj{a)’s interval on dQ (Fig. 4a). In this case, the anchor of qj is changed to a 
new one, say, q'j{a) in Fig. 4a. (The vertex qj needn’t be changed in the rotating 





Fig. 4. Two more cases in the rotation of the bridge (p, q). 
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process; otherwise, point q were not contained in the triangle described above.) 
In the situation where the current bridge (p, q) makes a tangent contact with a 
portion (or two edges) of Q, the candidate bridge should be found before or at 
the tangent contact (Fig. 4b). In the situation where the current bridge (p,q) 
touches another vertex v' of polygon P, the center of the rotation is changed 
from V to v'. In all cases, the rotated ranges on both dQ and dP are continuous 
Therefore, the length function considered in the process of rotating the bridge 
(p,q) is convex, and thus we obtain the condition (4). □ 



3 The Algorithm 

Two types of the fans defined in Section 3.2 are dealt with by the following two 
lemmas, respectively. 

Lemma 3 Let pi, pj denote two vertices of P such that GV{pi) and GV{pj) 
are adjacent, and pij the intersection point of dP with their common edge. All 
fans F{pij), for pi,pj G P, can be computed in O(nlog^n) time. 

Proof. Before describing our algorithm, we briefly review the result of Good- 
rich and Tamassia on dynamic ray shooting and shortest path queries [3]. For 
a connected subdivision of the plane into simple polygons (i.e., no “islands” ex- 
ist), one can maintain a data structure of size 0(n) that supports query time of 
0(log^ n) for a ray shooting or shortest path in a region of the subdivision (plus 
0{k) if the path of length k is to be reported), and update time of 0(log^ n) for 
each addition/deletion of an edge/ vertex. 

To use dynamic ray shooting and shortest path algorithms, we construct a 
planar subdivision that always consists of a bounded region and an unbounded 
region. The bounded region, denoted by PQ, is an approximation of the exterior 
of the union P U Q. The region PQ is initially constructed as follows. Let a 
be the highest point among all vertices of P and Q. Place two points b and c 
sufficient below and to the left and right of the union of P and Q, respectively, 
and connect three points a, b, c so that P and Q are contained in the triangle 
Aabc- Make a copy a' of the vertex a. Then, find a point p* on dP and a point 
q* on dQ such that they are visible each other. Split each of p* and q* into two 
and insert two edges: one connects vertices p* and q*, and the other does their 
copies. The resulting (simple) polygon PQ turns the original polygons P and Q 
inside out (Fig. 5). 

It is easy to see that a ray shooting query in the exterior of the union PU Q 
can be performed in PQ with a slight modification that the ray hits nothing 
if the hit point is on the boundary of Aabcj and an extra ray shooting needs 
to perform if the hit point is on the pseudo-edge {p*,q*) or its copy. Special 
attention should be paid to the shortest path queries. This is because a shortest 
path in the exterior of the union P \J Q may require to go across the pseudo- 
edges or even wrap around the highest vertex a. When a shortest path query is 
performed in PQ, we first find a new pair of mutually visible points, one on dP 
and the other on dQ, such that the shortest path to be computed should not go 
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Fig. 5. Turning polygons P and Q inside out. 



across the line segment connecting them. After the polygon PQ is accordingly 
modified, the shortest path query can safely be performed. Also, the shortest 
path between two points u and v in the exterior of P U Q is the shortest one of 
three paths 7t(m, v), 7t(u, a) U 7r(a, v) and 7t(u, a') U 7r(a', v) in PQ. 

First, all points pij can be obtained from the intersections of dP with edges 
of the diagram GV{P). To compute the interval [Pijjpfj] on dQ, we shoot two 
rays at pij along Li and L2 in polygon PQ. If both of them hit dQ, the interval 
[Pij,Pij] is obtained and we are done. If only one of them, say, the point hit by 
the ray along L2, is on dQ, then we obtain the point pj y To compute the point 
pI j, we modify polygon PQ by inserting the edge connecting pi j with the hit 
point on dQ and its copy, and by removing the old pseudo-edges. Let /2 denote 
the intersection point of dQ with L2, which is farthest to pij along L2. Point 
/2 can be computed by shooting a ray at infinity in the converse direction of L2 
in the exterior of polygon Q. See Figs. 6a-b. Then, find the first turn point of 
the path Tr{pij, f2) in the polygon PQ. If the turn point is on dQ, it is just p] ^ 
(Fig. 6a). Otherwise, shoot a ray at pi^j to the first turn point (on dP), and the 
hit point on dQ gives p] j (Fig. 6b). 




Fig. 6. Illustration for the proof of Lemma 3. 



Consider now the situation where two rays along L\ and L2 hit dP. Let p\ 
and p'l denote two intersection points of dP with Li, closest and farthest to pij, 
respectively. Modify the polygon PQ by inserting the segment (pij,pi) and its 
copy, and by removing the old pseudo-edges. Then, find the first turn point of 
the path Tr{pij,p'i) resulting polygon PQ. If the turn point is on dQ, it is 

an endpoint of the interval [p] j,Pi j]- Otherwise, shoot a ray at pij towards that 
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turn point. If the hit point is on dQ, it gives an endpoint of [Pij,Pi j], say, p] j 
in Fig. 6c; otherwise, the fan F{pij) is empty. If F{pij) is not empty, the other 
endpoint of [Pij,Pi j] can similarly be found by considering two intersections of 
dP with L 2 , one closest to pij and the other farthest to pij. 

Finally, consider the situation where two rays shot at Pij along Li and L 2 
hit nothing. If a point of Q is contained in T(Li, L 2 ), the convex hull CFf{Q) of 
Q is completely contained in T(Li, L 2 ), and thus points p] j, pF can be obtained 
from two tangent points of pij to CFl{Q). Otherwise, the fan F{pij) is empty. 

Since a constant number of operations is needed, the interval [pj j, Pi j] can 
be computed in 0(log^ n) time. Hence, the lemma follows. □ 

Lemma 4 LetGV(pi) he a non-empty farthest-site geodesic Voronoi region, and 
A{pi) the set of anchors of all points on the boundary ofGV(pi) with respect to 
vertex Pi- All fans F{pi{a)), for Pi € P and Pi{a) G A{pi), can he computed in 
O(nlog^n) time. 

Proof. As it can be proved by an argument similar to the proof of Lemma 
3, we omit the detail in this extended abstract. □ 

By now, we can give the main result of this paper using the results obtained 
in the previous sections. 

Theorem 1 An optimal bridge between two simple polygons P and Q can he 
computed in O(nlog^n) time. 

Proof. First, compute the furthest-site geodesic Voronoi diagrams of vertices 
of P and Q, respectively [1]. As shown in Lemmas 3 and 4, all fans of P and Q 
can be computed in 0(nlog^ n) time. 

It follows from the condition (1) of Lemma 2 that if pi and pj are two vertices 
farthest to the point pij in the geodesic metric, then pij is a candidate of the 
points p of optimal bridges. In particular, we call such points Pij the candidate 
points. For convenience, we denote by p^, . . ., p^ and , . . ., p™ the sequences 
of candidate points on dP and dQ, respectively, and w{p^) (w{q^)) the geodesic 
distance from p* (q^) to the vertex farthest to p* in P {q^ in Q). 

To quickly find the candidate bridges satisfying the first three conditions of 
Lemma 2, we construct a hierarchical structure that consists of segment trees, 
range trees and persistent search trees [6,7]. For this purpose, we need a new 
idea to represent the fan of a candidate point. Order all vertices and candidate 
points on dP (dQ), say, by a clockwise scan of the polygon boundary, starting 
at a specified vertex Vp G P {vq G Q) . For a complete ordering, we consider the 
point Vp as two points Vpi and Vpr such that Vpi < p < Vpr, for all points p on 
dP. The treatment for the point Vq is the same. Let p*^, p*^ denote the first 
intersection points of dQ with two bounding half lines of F{jf). The fan F(p*) 
can then be represented by the interval [p*^,p*^j if Vq is not contained in F{p'‘), 
or two intervals [wg/,p*^] and [p*^,Vqr] otherwise. 

Consider now how to compute an optimal bridge (p, q) that is not collinear to 
the first segment of the path 7r(p,p') or TT{q, q'). For each point p* (1 < i < 1), we 
want to find a point q^ such that p* and q^ lie in F{qQ and F{p^), respectively. 
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and d{p'^,q^) + w{q^) is minimized. (In this case, w(p*) + d{p^,q^) + w{q^) is 
also minimized.) First, organize the intervals of all fans F{q^) (1 < q^ < m) 
into a segment tree. An interval is stored in O(logn) nodes, and thus each node 
s of the segment tree has an associated set COV ER{s) of the intervals. For 
each governs), store the candidate points whose corresponding intervals are 
in COV ER{s) into a second-level range tree. Again, a candidate point is stored 
in O(logn) nodes, and each node r of the range tree for COV ER{s) has an 
associated set SU B{r) of candidate points. For each SUB{r), we further compute 
the additively weighted (nearest) Voronoi diagram with the weights w{q^) for 
all candidate points q^ in SUB{r), and organize the diagram into a third-level 
persistent search tree. The resulting data structure is called the segment-range- 
persistent tree. Since it takes 0(n log n) time and space to construct a segment 
or range tree, and O(nlogn) time and 0{n) space to compute a Voronoi diagram 
and a persistent search tree, the segment-range-persistent tree can be constructed 
in O(nlog^n) time and O(nlog^n) space [6,7]. 

Given a query point p* , we find the point q^ which is optimal with respect to p* 
as follows. In the top-level segment tree, find the sets COV ER{s) whose intervals 
contain p*. In the second-level range trees storing the found sets COV ER{s), we 
further find the sets SUB{r) whose candidate points (on dQ) are contained in 
the interval [p*^,p*^] or in two intervals [uq/,p®^] and [p'^'^,Vqr]. In the third-level 
persistent search trees, find the candidate points of Q which are locally nearest 
to p* using the point location algorithm [7]. The point q^ optimal with respect to 
p* is then found from answers of these point location queries. Clearly, the query 
time taken for p* is O(log^n). Since the number of candidate points on dP is 
0(n), the time taken to enumerate all of such candidate bridges is 0(nlog^ n). 

Analogously, the candidate bridges satisfying the conditions (2) and 3 of 
Lemma 2 can be computed in O(nlog^n) time. We omit the detail in this ex- 
tended abstract. 

To find the candidate bridges satisfying the condition (4), we consider the 
bounding half lines of anchor’s fans which touch a vertex of P or Q. Assume that 
a bounding line of the fan P{pi{a)) touches a vertex v of P and that q{pi{a)) 
is the first intersection point of the bounding line with dQ. Let qj denote the 
vertex of Q furthest to q{pi{a)) in the geodesic metric, and qj{a) the anchor of 
q{pi{a)) with respect to qj. If qj{a) and v are in the same side of the line through 
{pi{a) , q{pi{a))) , we should perform the rotating process described in the proof 
of Lemma 2. 

Assume that pi is the intersection point of dP with the segment {pi{a),v), 
and p 2 the endpoint of the edge of P which contains pi and is contained in 
F{pi{a)). Consider the process of rotating the bridge (p,q), with point p con- 
tained in the edge (pi,p 2 ). First, we discuss how to handle the events (places) 
where rotation centers are changed. To find the first place where the center is 
changed, we compute the first turn point of the path tt{v, qj{a)) in the exterior 
of P, and the intersection point of the edge (pi,P 2 ) with the line passing through 
V and the first turn point. (The first turn point of 7r(pi, qj(a)) in the exterior of 
P can be found in logarithmic time [4].) If the intersection point does not exist. 
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the center is kept unchanged in the process of rotating bridge (p, q) , with point p 
contained in edge (pi,p2). If the intersection point is found, the rotation center 
is changed from v to that turn point when p reaches the intersection point. In 
this case, we consider (pi,p2) as two edges, separated by that intersection point. 
This allows us to consider only the situation where the center is kept unchanged 
in the process of rotating bridge (p,q), with point p contained in edge (pi,p2)- 
To find the candidate bridge quickly, we perform a binary search on the edges 
of Q which are rotated, using the distance computed for edge (pi,P2) and the 
chosen edge of dQ. (Given Pi{a), v, qj{a) and a pair of the edges of P and Q, the 
position of p and q on the edges giving the local minimum can be computed in 
constant time.) Then, what is the range of the rotated edges on dQ? Clearly, the 
point q{pi{a)) is the starting point of the rotated range. However, it takes 0 {n) 
time to exactly give the ending point, as it may require to follow a sequence 
of anchors qj{a). To overcome this difficulty, we temporarily consider it as the 
endpoint of GV{qj) towards which the point q moves. (Recall that point qj 
cannot be changed in the rotating process.) We then perform the binary search 
on the edges of Q contained in the range described above. Whenever the points 
p, q giving the local minimum do not exist in two considered edges, they occur 
like the segment {p*, q*) shown in Fig. 3, or the point of dQ hit by the ray shot 
at V towards q in the exterior of Q is not q (i.e., the segment (v, q) intersects with 
dQ), we consider the computed distance as oo. This makes sure that the binary 
search should report a pair of points p and q giving the local minimum, with the 
point q contained in the real range of the rotated edges on dQ. If the reported 
point p is not identical to p2, then {p, q) gives a candidate bridge. Otherwise, we 
need to perform the same procedure for the edge of dP, which is next to (pi,p2) 
and contained in F{pi{a)). For this edge, the starting endpoint of the rotated 
range on dQ is already known. In this way, we can find all candidate bridges 
satisfying the condition (4) in O(nlogn) time. 

Finally, we simply report the one among all found bridges which minimizes 
the maximum distance between P and Q. It completes the proof. □ 
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Abstract. Generating high quality meshes is one of the most important 
steps in many applications such as scientific computing. Sink insertion 
method is one of the mesh quality improvement methods that had been 
proposed recently. However, it is unknown whether this method is com- 
petitive to generate meshes with small number of elements. In this paper, 
we show that, given a two-dimensional polygonal domain with no small 
angles, the sink insertion method generates a well-shaped mesh with 
0(n) triangles, where n is the minimum number of triangles generated 
by any method with the same quality guarantee. 

We also show that the sink insertion method more likely can not guar- 
antee the same result for a three-dimensional domain, while the other 
methods such as Delaunay refinement can achieve. 

Keywords: Mesh generation, Delaunay triangulations, sink insertion, 

computational geometry, algorithms. 



1 Introduction 

Mesh generation is the process of breaking a geometric domain into a collection 
of primitive elements such as triangles in 2D and tetrahedra in 3D. It has plenty 
of applications in scientific computing, computer graphics, computer vision, geo- 
metric information system, and medical imaging. Some applications have a strict 
quality requirement on the underlying meshes used. For example, most of the 
numerical simulations require that the mesh is well-shaped. In addition, some 
numerical simulations methods, for example, the control volume method, prefer 
the mesh to be a Delaunay triangulation. 

Recently, Edelsbrunner et. al [2,3] proposed a new mesh improvement method 
based on sink insertion. This new method guarantees to generate a Delaunay 
mesh with a small radius-edge ratio. Edelsbrunner and Guoy (private communi- 
cation) found that the sink insertion method tends to be more economical when 
we want to add as many points as possible at the same time to refine the mesh 
while maintaining the Delaunay property. It will also be useful in the software 
environment with off-line Delaunay triangulation or parallel meshing. In stead 
of dealing with all the circumcenters as many as the number of bad elements, 
they deal with small number of sinks of these bad elements. From experiments, 
they observe as many as 100 bad tetrahedra sharing the same sink. However, 
unlike Delaunay refinement, it is an open problem whether the sink insertion 
method generates an almost-good Delaunay mesh with 0{n) simplex elements, 
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where n is the minimum number of d-dimensional simplex elements generated 
by any other methods with the same radius-edge ratio quality guarantee. 

In this paper, we show that the sink insertion method guarantees to generate 
a well-shaped mesh with size 0{n) in 2D. For a three-dimensional domain, unlike 
the Delaunay refinement methods, the size optimality is not guaranteed because 
of the existence of slivers in an almost-good Delaunay mesh. We give an example 
that suggests that the sink insertion method may not guarantee to generate a 
mesh with size 0{n) for a three-dimensional domain. 

The rest of the paper is organized as follows. In Section 2, we review the 
sink insertion algorithm proposed by Edelsbrunner et. al. In Section 3, we prove 
that the sink insertion method guarantees to generate a well-shaped mesh with 
size 0{n) for a two-dimensional PLC domain with no small angles. In section 4, 
we discuss why the sink insertion method may not guarantee that the generated 
mesh has size 0{n) for a three-dimensional PLC domain. Section 5 concludes 
the paper with further discussions. 

2 Preliminary 

A simplicial mesh is called almost good if each of its simplex elements has a 
small radius-edge ratio, which is the circumradius divided by the shortest edge 
length of the simplex. Hereafter we use p(r) to denote the radius-edge ratio of 
a simplex r. Several theoretical and practical approaches have been proposed 
for generating almost-good Delaunay meshes. Assume the spatial domain that 
does not have small angles is given in terms of its piecewise linear complex 
boundary (PLC) [7]. It has been shown that the Delaunay refinement methods 
[1,5,6] generate an almost-good Delaunay mesh with size 0(n), where n is the 
minimum number of elements for any mesh with the same radius-edge ratio for 
the same geometric domain. 

2.1 Sink 

Edelsbrunner et al. [2,3] defined the sink of a simplex cr in a Delaunay complex 
by the following recursive approach. 

Definition 1. [sink] In a d-dimensional Delaunay complex, let Ca be the cir- 
cumcenter of a d-simplex a; let M(<j) be the set of d-simplices that share a d — 1 
dimensional face with a. For each simplex t € M{a), let H^. be the half space 
containing r bounded by the d — 1 dimensional face shared by t and a. A point 
z is a sink of a when 

— z is Co- and it is contained in a; or 

— z is a sink of t G M{a) and Co is contained in Hr. 

A simplex containing its own circumcenter is called a sink simplex. Edels- 
brunner et al. [2,3] showed that there is no loop in the definition of sink among 
all d-dimensional simplices by proving that the circumradius of the simplex t 
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containing the sink of a simplex a is not less than the circumradius of a. Notice 
that if the circumcenter of a boundary simplex is not inside the domain, then 
its sink is not defined by the above definition. For this case, we just define its 
circumcenter as its sink. Given a boundary /c-simplex a (k < d) contained in a 
fc-dimensional boundary polyhedron, its sink is defined by considering only that 
boundary polyhedron. 

The min-circumsphere of a fc-simplex r in d-dimensions is the smallest d- 
dimensional sphere that contains all vertices of r on its surface. When k = d, 
the min-circumsphere is also called the circumsphere. A point is said to encroach 
the domain boundary if it is contained inside the min-circumsphere of a boundary 
/c-simplex, where k < d. 

Let T be the set of tetrahedron in a 3-dimensional Delaunay mesh and T = 
TU{too}, where Too represents the outside of the domain, called dummy element. 
Edelsbrunner et al. [2] defined the flow relation ^ C 7~ x T with ti ^ T 2 if 

1. Ti and T 2 share a common triangle v, and 

2. the interior of and the circumcenter c of lie on the different side of the 
plane containing v. 

If Ti ^ T 2 , then Ti is called the predecessor of T 2 ; and T 2 is called the suc- 
cessor of Ti . Here predecessor and successor are only meaningful for a Delaunay 
tetrahedron. The set of descendants of tetrahedron r is defined as 

desc(r) = {r} U descT^^(/r), where desci-^p(r) = desc{p). 

Notice that for a triangle, there is only one successor defined, while there are 
only at most two successors defined for a tetrahedron. A sequence of tetrahedra 
with Ti ^ T 2 . . . ^ T„ is called a flow path from ti to r^, denoted by 7r(ri,r„). 
See the left figure in Figure 1 for an illustration in 2D. 

2.2 Sink Insertion Algorithms 

Sink insertion method, proposed by Edelsbrunner et al. [2,3], inserts the sinks 
of bad d-dimensional simplex elements instead of inserting their circumcenters 
directly. A simplex element is bad if its radius-edge ratio is larger than a con- 
stant g. For the completeness of the presentation, we review the sink insertion 
algorithm for a three-dimensional domain. 

Algorithm: Sink-Insertion(£»o) 

Empty Encroachment: For any encroached boundary segment, add its mid- 
point and update the triangulation. For any encroached boundary triangle, 
add its sink and update the triangulation. If the sink to-be-added encroaches 
any boundary segment, we split that segment instead of adding that sink. 
Bad Elements: For any tetrahedron a with p{a) > go, find its sink Sa. Assume 
that So- is the circumcenter of a tetrahedron t. Insert the sink So to split r and 
update the Delaunay triangulation. However, if So encroaches a boundary 
segment or triangle, we apply the following rules instead of adding So. 
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Equatorial Sphere: For any boundary triangle ^ encroached by the sink Sa^ 
add the sink of /r. Update the triangulation accordingly. However, if 
encroaches any boundary segment, we apply the following rule instead. 
Diametral Sphere: For any boundary segment v encroached by the sink or 
the sink s^, add the midpoint of v. Update the triangulation accordingly. 

Recall that the insertion of the circumcenter of a bad d-simplex will imme- 
diately remove the simplex. Inserting the sink of a bad tetrahedron may seem 
counter-intuitive, because the sink of a d-simplex could be far away from it. 
Consequently, the insertion of the sink may not remove the bad d-simplex im- 
mediately. The termination of the algorithm may be in jeopardy. However, it 
is proved that the circumradius of a tetrahedron a is no more than that of the 
tetrahedron r containing the sink of cr[2,4]. Then the proofs of the termination of 
Delaunay refinement method [6] can be applied directly to prove the termination 
of the sink insertion algorithm. If we select g»o > then the minimum distance 
among mesh vertices after the sink insertion will not decrease, which implies the 
algorithm’s termination. If there are boundary constraints, the constant qq has 
to be increased to -\/2 for 2D domain and 2 for 3D domain. 

3 Good Grading Guarantee in 2D 

This section is devoted to study the number of elements in the generated two- 
dimensional mesh by analyzing the relation between the nearest neighbor func- 
tion N{) defined by the final mesh and the local feature size function Z/s() defined 
by the input domain. Here N{x) is the distance from x to its second nearest mesh 
vertex. A mesh vertex v is always the nearest mesh vertex of itself. Local feature 
size lfs{x) is the radius of the smallest disk centered at x intersecting two non- 
incident input segments or input vertices. Both N{) and lfs{) are 1-Lipschitz 
function. A mesh is said to have a good grading if the nearest neighbor function 
N{) defined on the mesh is within a constant factor of lfs{). 

We study the spacing relations among intermediate meshes by using similar 
idea as Ruppert and Shewchuk did. With each vertex u, we associate an insertion 
edge length e„ equal to the length of the shortest edge connected to v immediately 
after v is introduced into the Delaunay mesh. Here v may not have to be inserted 
into the mesh actually. For the sake of convenience of analyzing, we also define 
a parent vertex p{v) for each vertex v, unless v is an input vertex. Intuitively, 
for any non-input vertex v, p(v) is the vertex “responsible” for the insertion of 
V. We discuss in detail what means by responsible here. If v is inserted as the 
sink of a triangle cr with p{a) > Qq, then p(y) is the most recently inserted end 
point of the shortest edge of a. If all vertices of a are original input vertices, 
then p{v) is one of the end points of the shortest edge of a. If v is the midpoint 
of an encroached segment, then p{v) is the encroaching vertex. For the sake of 
simplicity, we always assume that the encroaching vertex is not an input vertex, 
because Ruppert [5] and Shewchuk [6] showed that the nearest neighbor function 
N 0 defined on the Delaunay mesh after enforcing the domain boundary is within 
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a constant factor of the local feature size function, i.e., N{v) ~ lfs{v). The parent 
vertex p{v) of v does not need to be inserted into the mesh actually. 

We then show that e„ of any introduced mesh vertex v is related to that of 
its parent vertex p(v) . Here v may also not be inserted due to encroaching. For 
a vertex v, as [6,4], we define the density ratio at point v as Dy = Clearly, 

Dy is at most one for an input vertex v, and for newly inserted vertex v, Dy 
tends to become larger. Notice that Dy is defined just immediately after v is 
introduced to the mesh; it is not defined based on the final mesh. 

Lemma 1. [Radius-VARIATION] Consider a triangle a. Let u and e„ be the 
circumcenter and the circumradius of a. Assume that there is no triangle with a 
large radius-edge ratio in desc{a) except possibly a itself. Assume triangle t ^ a 
contains the sink of a inside. Let p. be the predecessor of triangle r. Let w and 
Cyy be the circumcenter and circumradius of the triangle p. Then 

1 ,, 

Sw - Sy > U> - U . 

4po 

Proof. Let’s consider a triangle Ti = pqr and its successor T2 = pqs in desc(cr), 
where T 2 yf r. Triangle T 2 has a small radius-edge ratio from the assumption of 
desc{a). Let x and y be the circumcenter of ti and T 2 respectively. Let Cx and 
6y be the circumradius of t\ and T 2 respectively. The right figure in Figure 1 
illustrates the proof that follows. Let c be the midpoint of the edge pq. Let 




P 




Fig. 1. Left: the descendants of a triangle pqr; Right: the relationship between 
the circumradius Cx of ti = pqr, the circumradius Cy of T 2 = pqs and jja; — j/||. 

a = /tcqx and /3 = Ixqy. Assume that cq has a unit length. Then Cx = 1/ cos(a), 
By = l/cos(a -I- (3), and jja; — yjj = tan(a P) — tan(a). It is easy to show that 

By-ex sin(o;-|-f) B. 

T7^ 17 = 3-^ = sinla P) — cos(a -I- /?) tan( — ). 

\\x-y\\ cos(f) 2 

Assume we fix the value of a P = 9 q, then it is easy to show that 

sin(a-|-§) . fa ^ fa \i. j. 

-j— > sm(6'o) - cos(6»o) tan( — ) = tan( — ). 

cosl^l ^ ^ 
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Let ps' be a diameter of the circumcircle of the triangle pqs. It is easy to show 
that ll? — s|| < ||g — s'll = 2||c— 1 /|| = 2cy sin(a-|-/3). The triangle T 2 = pqs has a 
small radius-edge ratio implies that H? — s|| > Thus we have sin(a-|-/3) > 

It follows that pEffi > tan(^) = > 4 ^. Using the triangle inequality, 

it is easy to show that — e„ > ~ wH by summing up the inequalities 

for all triangle and successor pairs t\ and T 2 in desc{a) with T 2 yf r. ED 



Theorem 1. [Bounded Density Ratio] There are fixed constants Di > 1 
and D 2 > 1 such that for any vertex v inserted or rejected as the sink of a 
bad triangle, Dy < D 2 ; for any vertex v inserted or rejected as the midpoint 
of an encroached boundary segment, Dy < D\. Hence, there is a constant D = 
max{Di, D 2 } such that Dy < D for all mesh vertex v. 



Proof. We prove this by induction. If v is an original input vertex, then the 
length 6y of the shortest edge connected to v is at least Ifs (v) from the definition 
of lfs{v). Thus Dy = < 1 and the theorem holds. 

Then consider non-input vertex v. We first consider inserting v as the sink 
of a triangle a. Let u be the circumcenter of the triangle a. Notice that v is also 
the sink of any triangle from desc{a). Without loss of generality, assume that 
no triangle except cr from desc{a) has a large radius-edge ratio. Assume v is the 
circumcenter of a triangle t. Notice that Cy is equal to the circumradius of r. 

Case 1: the triangles a and r are the same. Notice that a has a radius-edge 
ratio at least qq, then parent p of vertex v is one of the end points of the shortest 
edge pq of a . Here p could be the most recently inserted vertex or an original 
vertex of a. Then q is an original vertex or is inserted before p. In both cases, 
we have Cp < \\p— gjj. Then <\\p — q\\< — . And Cy = Ra- > go ■ Cp. Notice 
that lfs{p) < DpCp, where Dp is the density ratio bound of vertex p derived from 
induction. Thus lfs{u) < lfs{p) -I- jjp — ujj < DpCp + Cy < + l)e„. It implies 

that Dy = 1 . So a sufficient condition for bounding the density 

ratio of vertex u is 

+ ( 1 ) 

go 

Case 2: the triangles cr and r are not the same. Let w be the circumcenter of 
the predecessor triangle of r. Similarly we define Cy, as the circumradius of that 
triangle. Then by previous lemma 1, we know that there is a constant 6 = ^ 
such that Cy, — Cy > 6\\w — u\\. Here w and u could be the same. 

Subcase 2.1: vertices w and v are not close, i.e., jju — icjl > ee„, where e = 
Then similar to the previous lemma 1, e„ — e„ > <5|jf — ujj. For point v, we have 
lfs{v) < lfs{u) + ||u - v|| < lfs{u) + l{ey - Cy) < {1 + ^ - |)e„ -k Thus a 
sufficient condition that Dy = < D 2 is (l+^~y)eii < {D 2 — \)ov From 

e„ < Cy, this inequality is satisfied ifl-k^ — | < D 2 — and D 2 — \ >0. 
From Dp < max(Di, D 2 ), a sufficient condition that Dy < D 2 is 



D 2 > max(-, 1 -I ). 

0 On 



£•0 



( 2 ) 
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Subcase 2.2: vertices w and v are close, i.e., ||u — w|| < ee„, where e = For 
vertex w, similar to subcase 2.1, we have lfs{w) < (1 + ^ — j)e„ + jCu,. Then 
ifs{v) < lfs{w) + ||u — icll implies that lfs{v) < (1 + ^ — j)e„ + |e„ + ee„ < 
(1+ ^ — j)e„ + (j + e)e„. Thus, from e„ < e„, a sufficient condition that 

Dy < D 2 is 1 + ^ — j < D 2 — j — e, and D 2 — ^ — e > 0. Consequently, we 
need 



D 2 > max(l + e + 



max(ZIi,L)2) 

Qo 




(3) 



Case 3: vertex v is the midpoint of a segment that is encroached by a vertex 
w. Here w is the sink of a triangle a with large radius-edge ratio. Assume that 
the sink w is contained in triangle r. Let u be the circumcenter of triangle cr. 

Subcase 3.1: vertices w and u are not same. From the analysis of case 2, we 
have lfs{w) < (1 -k ^ — y)e„ + (| + e)eu, by substituting u by w in the results. 
The vertex w is inside the circumcircle centered at v with radius e„. Therefore 
ew < v^e„. From lfs{v) < Z/s(w)-|- ||w-u|| < (1+ - j)e« -k -k v^e-k l)e^, 

inequality Dy < Di holds ifl-k^ — | < Di — -\/2(y -k e) — 1 and D\ > 
•\/2(j -k e) -k 1. Consequently, a sufficient condition is that 



Di > max{2 -k - + e) 
0 



1 + + 

d go 



(4) 



Subcase 3.2: vertices u and w are the same. We have lfs{v) < (/s(u)-k||u— u|| < 
(1 + -k e„ < (1 -k -k e„. To prove Dy < Di, we need 



^0 



72(1+ ^) + l < Di. 

Qo 



(5) 



As conclusion, if we choose D\ and D 2 as the follows. 



D\ = max 



1 + 72(5 + e), 

(+2-ei)eo , n /I I 1 I I 

eo -+2 ’ and D 2 = max(- + e, 1 + eH ), 

Qo{25+^/2-l + ^/2e5) ° ^*0 

(5(eo-i) ’ 



then all inequalities are satisfied. Notice here 5 = and e = 



□ 



The following theorem concludes that the generated mesh has a good grading, 
i.e., for any mesh vertex v, N{y) is at least some constant factor of lfs{v). 

Theorem 2. [Good Grading] For any mesh vertex v generated by the sink 
insertion method, the edge incident on v has length at least ■ 

The proof is omitted here. The values corresponding to Di and D 2 guaranteed 
by the Delaunay refinement method [5,6] are small: Dy = js at least ^ 
for a vertex v inserted as the circumcenter of a bad triangle and at least — 

£•0 — v2 
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for a vertex v inserted as the midpoint of an encroached segment. For instance, 
Ruppert claims that if the smallest angle is 10 degrees, then no edge is smaller 
than i of the local feature size of either of its endpoints. To guarantee the 
minimum angle 10 degrees, we need qq = ~ 2.88. Then 5 « 0.087 and 

e « 0.174. So Di « 17.54 and Z?2 ~ 16.69. It then implies that no edge is smaller 
than ^ of the local feature size of either of its endpoints in any mesh generated 
by the sink insertion method. Therefore, the theoretical bound on the number 
of elements of the mesh generated by sink insertion method is more likely larger 
than that by the Delaunay refinement method. 

Ruppert shows that the nearest neighbor value N(v) of a mesh vertex v of 
any almost-good mesh is at most a constant factor of lfs{v), where the constant 
depends on the radius-edge ratio. The above Theorem 2 shows that the nearest 
neighbor N{v) for the 2D Delaunay mesh generated by sink insertion is at least 
some constant factor of lfs{v). Then we have the following theorem. 

Theorem 3. [Linear Size] The number of triangles in the 2D mesh generated 
by the sink insertion method is within a constant factor of any Delaunay mesh 
for the same domain, where the constant depends on the radius-edge ratios of 
the meshes. 

4 Discussions for 3D Domain 

Shewchuk [6] showed that the Delaunay refinement method generates almost- 
good meshes with a good grading guarantee in two and three dimensions. We had 
showed that the sink insertion method also generates a almost-good mesh with a 
good grading guarantee in two dimensions. Unfortunately, the proofs presented 
here can not be directly applied to three dimensions. The reason is as follows. To 
guarantee the size optimality of the sink insertion method, the nearest neighbor 
function N{) defined on the generated mesh must be within a constant factor 
of the local feature size function lfs{). Notice that lfs{v) > e„ > N{v), where 
6y is the length of the shortest edge connected to v when vertex v is inserted. 
Therefore, when a vertex v is introduced to the mesh, e„ should be within a 
constant factor of lfs{v) to guarantee the good grading of the generated mesh. 
In other words, we have to prove the existence of a constant D such that for 
each vertex v inserted into the mesh, lfs{v) < Dcy. Or more specifically, there 
should exist three constants Di, D 2 , and D3 such that 

— for each vertex v inserted as the circumcenter of a tetrahedron, lfs{v) < D^e^; 

— for V inserted as the circumcenter of a boundary triangle, lfs{v) < D 2 Cy; 

— for V inserted as the midpoint of a boundary segment, lfs{v) < DiCy. 

Assume that we insert a vertex u as a sink of a tetrahedron cr, and v is the 
circumcenter of a tetrahedron r. Notice that the structure of desc{a) is a DAG 
whose node out-degree is at most 2. Consider a tetrahedron r from desc{a). 
Without loss of generality, we assume that there is no tetrahedron in 7t(ct, t) 
with a large radius-edge ratio except tetrahedron a itself. In three dimensions. 
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the fact that a tetrahedron has a small radius-edge ratio does not guarantee that 
the tetrahedron has no small angles. Slivers are the only tetrahedra that have a 
small radius-edge ratio but their aspect ratio could be arbitrarily large. Let u and 
6u be the circumcenter and the circumradius of a. Let be the circumradius of 
the tetrahedron r. It is possible that e„ is almost the same as e„ even vertex v is 
far away from u. Consequently, could not be bounded from below by any 

constant. Figure 2 gives an example of a configuration such that could 

be arbitrarily small. This, together with the fact that lfs{v) could be as large as 
Z/s(m) -I- ||u — ull implies that Dy = could be much larger than 
Therefore we can not bound the density Dy using the relation between Dy and 
Dy even assuming that the density Dy of vertex u is bounded by a constant D^. 

Figure 2 is constructed as follows. Let sliver poqoroso be a successor of the 
tetrahedron cr, which is not shown in the left figure of Figure 2. Assume that the 
circumcenter of the sliver Poqo^oSo is on the different side of the plane L/pgroso 
passing po, tq and sq with sliver po'Zo^'oSo- One of the successor of sliver pogorgso 
is constructed by lifting vertex go to a new position qi such that tetrahedron 
PogiroSo is a sliver and its circumcenter is on the different side of the plane 
Hpoqiso passing pq, gi and sq with the tetrahedron pogiroSQ. Then we lift the 
node tq to ri to construct a sliver successor poQi^iSo of tetrahedron PogirgSo 
whose circumcenter is on the different side of the plane with po9i?'iSo- 

We then construct a sliver successor p^givisi of pogiriso by lifting sq to si . Sliver 
successor piqirisi is constructed by lifting po to a new position pi. The above 
procedure could be repeated many rounds ifpo<Zo^oSo is carefully configured and 
every lifting is carefully chosen. The middle figure in Figure 2 give the sliver 
pattern used in constructing this example. It is easy to show that using only the 
tetrahedron r also can not bound the density ratio Dy = Assume that p 

is a vertex of the shortest edge of tetrahedron r; see the right figure in Figure 2. 
Then we have Dy <1-1- '^Dp. Here the situation Dy = 1-1- -^Dp could be 




Fig. 2. Left: all sliver descendants; Right: the sink simplex does not help. 



achieved if r is a regular tetrahedron. It implies that the upper bound for Dy 
could be always larger than that for Dp. However, we doubt that these situations 
can really happen in practice. 
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5 Conclusion 

In this paper, we show that the sink insertion method guarantees to generate 
a two-dimensional mesh with good grading. On the other hand, we also give 
an example of three-dimensional local mesh configuration to show that the sink 
insertion method may fail to generate a mesh with size 0{n), where n is the 
minimum number of the mesh elements with the same radius-edge ratio quality. 

As reported by the experimental results (Guoy and Edelsbrunner, private 
communication), the sink insertion method usually generates meshes whose sizes 
are not much larger than that by Delaunay refinement method. However, it is 
interesting to see if we can theoretically prove the good grading guarantee of the 
sink insertion method or give an example of three-dimensional domain such that 
a sequence of sink insertions will generate a mesh whose size is larger than 0{n). 
Notice Li [4] recently gave a new refinement-based algorithm that generates well- 
shaped three-dimensional meshes with size 0{n). Li [4] also proposed a variation 
of the sink insertion method, which inserts a point near the sink and its insertion 
will not introduce small slivers compared to r instead of inserting the sink of a 
tetrahedron t with large radius-edge ratio or sliver. This variation guarantees 
to generate well-shaped 3D Delaunay meshes. However, it is open whether this 
variation will have a good grading guarantee. It is interesting to see what is the 
mesh size relation between two meshes generated by the sink insertion method 
and this variation proposed in [4]. 
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Abstract. In this paper, we present an 0(n® log n) time solution for 
the following multi-label map labeling problem: Given a set S' of n dis- 
tinct sites in the plane, place at each site a triple of uniform squares of 
maximum possible size such that all the squares are axis-parallel and a 
site is on the boundaries of its three labeling squares. We also study the 
problem under the discrete model, i.e., a site must be at the corners of 
its three label squares. We obtain an optimal 0(nlogn) time algorithm 
for the latter problem. 



1 Introduction 

Map labeling is a popular problem on information visualization in our daily life. 
It is an old art in cartography and finds new applications in recent years in GIS, 
graphics and graph drawing [AKS98, CMS93, CMS95, DF92, FW91, Imh75, 
Jon89], 

[KR92, KT97, KT98a, PZC98, Wa94, WW95, ZP99]. Among many problems 
in map labeling, labeling points is of special interest to many practitioners and 
theoreticians. In the paper by Formann and Wagner, any point site can only 
be labeled with 4 candidate (axis-parallel) squares each of which has a vertex 
anchored at the site and the objective is to maximize the square size. See Figure 

I, La, for an instance of the problem. (In general we call this kind of model dis- 
crete, i.e., each site has 0(1) candidates.) Even this seemingly simplest version 
is shown to be NP-complete and moreover; it is NP-hard to approximate within 
factor 2 [FW91]. (For details regarding NP-completeness readers are referred to 
[GJ79].) In the past several years more generalized models have been proposed. 
The basic idea is to allow each site to have an infinite number of possible candi- 
date labels (see [DMMMZ97, IL97, KSW99, SW99, ZP99]). This model is more 
natural than the previous discrete models (like the one in [FW91]) and has been 
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I.a 



Lb 



Il.a 



Il.b 




Ill.a Ill.b 

Fig. 1. Examples for one-label, two-label and three-label point labeling. 



coined as the sliding model in [KSW99]. On the other hand, designing efficient 
algorithms for map labeling under the sliding model is a new challenge to map 
labeling researchers. We briefly review some recent results on labeling points 
under the sliding model. 

Before our review, we briefly define some necessary concepts in approxima- 
tion algorithms. An approximation algorithm for a (maximization) optimization 
problem II provides a performance guarantee of p if for every instance I of 
77, the solution value returned by the approximation algorithm is at least 1/p 
of the optimal value for I. For the simplicity of description, we simply say that 
this is a factor p approximation algorithm for 77. 

In [DMMMZ97], Doddi et al. designed several approximation algorithms for 
labeling points with arbitrarily oriented squares and circles (though the constant 
factors are impractical: 36.6 and 29.86 respectively). They also presented bicri- 
teria PTAS for the problems. The former bound was improved to 12.95 by Zhu 
and Qin [ZQOO] and significantly to 5.09 by Doddi et al. most recently [DMMOO]. 
The latter bound on labeling points with circles was improved to 19.35 by Strijk 
and Wolff [SW99] and recently to 3.6 by Doddi et al. [DMMOO]. If any labeling 
square must be along a fixed direction (e.g., axis-parallel), Zhu and Qin showed 
that it is possible to have a factor-4 approximation [ZQOO]. In [KSW99] van 
Kreveld et al. proved that it is NP-hard to decide whether a set of points can all 
be labeled with axis-parallel unit squares under the sliding model. See Figure 1, 
Lb, for an example of labeling points with sliding axis-parallel squares. (In fact, 
in [KSW99], van Kreveld et al. tried to maximize the number of sites labeled 
instead of the size of the labels.) In [SW99] Strijk and Wolff used similar ideas to 
prove that the problem of labeling points with maximum size uniform circles is 
NP-hard. This explains why it is meaningful to study approximation algorithms 
for these problems. 
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As another kind of generalization for map labeling, recently Zhu and Poon 
studied the problem of labeling point sites with axis-parallel uniform square pairs 
and circle pairs. The motivation is that in some applications we need two labels 
for a site [KT98] (like labeling a map for weather reporting or labeling a bilingual 
city map). They obtained factor-4 and factor-2 algorithms for the two problems 
respectively, besides presenting a bicriteria approximation scheme [ZP99]. For 
map labeling with uniform square pairs, Zhu and Qin [ZQOO] first improved the 
approximation factor of [ZP99] from 4 to 3. Then Qin et al. further improved 
the factor to 2 [QWXZOO]. More recently, Spriggs proved that the problem is 
NP-hard and, in fact, NP-hard to approximate within a factor of 1.33 [SpOO]. 
See Figure 1, II. a and II. b for examples of labeling points with uniform square 
pairs under the discrete and sliding model respectively. 

For map labeling with uniform circle pairs, Qin et al. first improved the 2 
approximation factor to 1.96 and proved that problem is NP-hard, in fact, NP- 
hard to approximate within a constant factor of (5 > 1 [QWXZOO]. The 1.96 
factor was recently improved to 1.686 by Spriggs and Keil [SKOO] and then by 
Wolff et al. to 1.5 [WTXOO]. There are still some gaps between the lower and 
upper bounds for both of the two problems. 

In this paper, we study the problem of labeling point sites with uniform 
square triples. (See Figure 1, III. a, Ill.b for examples of labeling points with 
uniform square triples under the discrete and sliding model respectively.) The 
problem is interesting both in application and theory. In practice, many weather 
reporting programs on TV need to label a city with three labels: its name, 
temperature and chance of rainfall. In theory, the problem of labeling point sites 
with uniform squares is NP-hard under either the discrete model [FW91] or the 
sliding model [KSW99]. Also, the problem of labeling point sites with uniform 
square pairs is NP-hard under both the discrete and sliding model [SpOO] . On the 
other hand, labeling point sites with four squares is trivially polynomial solvable 
(the solution is decided by the closest pair of the point set, under the Loo metric). 
The labeling problems for points with circles and circle pairs are both NP-hard 
[SW99, QWXZOO]. Finally, we remark that it is impossible to label a point with 
three or more non-overlapping circles. 

We now define formally the problems to be studied. We also make some extra 
definitions related to our algorithms. The MLUST problem (Map Labeling with 
Uniform Square Triples) is defined as follows: 

Instance: A set S of points (sites) Pi, P 2 , •••, Pn in the plane. 

Problem: Does there exist a set of n triples of axis-parallel squares of maximum 
size (i.e., length of a side) L* each of which are placed at each input site Pi £ S 
such that no two squares intersect and no site is contained in any square. 

Notice that we can have two versions of the problem: the problem can be 
under the discrete model in which every site must be at the corners of its three 
labeling squares and the problem can also be under the sliding model in which 
every site can be on the boundaries of its three labeling squares. Because of the 
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nature of the problem even under the sliding model every site must be at the 
corners of at least two of its labeling squares. We present below a few extra 
definitions which will be used in later sections. 

Given a set S of n sites in the plane, the closest pair of S under Loo metric, 
Doo{S), is defined as the minimum Loo-distance between any two points in S. 
Clearly Doo{S) can be computed in 0(n log n) time with standard algorithm in 
computational geometry [PS85]. 

2 MLUST under the Discrete Model 

In this section we present the details of a polynomial time algorithm for the 
MLUST problem under the discrete model. Because of the nature of the problem, 
the metric discussed in this paper is Loo unless otherwise specified. Let Doo{S) 
be the closest pair of S under the Loo-metric. Let I* denote the size of each 
square in the optimal solution of the discrete MLUST problem. The following 
lemma is easy to prove. 

Lemma 1. Doo{S)/2 <1* < Doo{S). 

In the following we show how to decide whether a set of points S can be 
labeled with square triples with edge length I, Doo{S)/2 < I < Doo{S). For any 
two points Pi,Pj € S, let doo{Pi,Pj) denote the Loo-distance between them. Let 
Ci denote the Loo-circle centered at point pi G S with radius 1. Clearly, the circle 
Ci contains no other point from the input set S except its own center. Note that 
each circle Ci can be partitioned into four Loo-circles with radius 1/2, which are 
geometrically squares with edge length 1. We loosely call them sub-squares of Ci. 
Basically, to label pi we need to select three out of four sub-squares in each Ci 
so that no two sub-squares intersect each other. 

We now present our algorithm. First, we compute the following multiple 
intersection graph Cm{S,1). Ci (1 < i < n) are the vertices for Gm{S,1). There 
is an edge between Ci,Cj if they intersect and only one pair of sub-squares of 
them overlap. If at least two (and at most three) pairs sub-squares of Ci,Cj 
overlap, then we draw a multiple of two edges between Ci, Cj. (Recall that two 
squares overlap if they have a common interior point.) 

Lemma 2. There is a valid labeling for S with square triples of size I if and 
only if every connected component of Gm{S,1) has at most one cycle. 

Proof: We refer to Figure 2. By the definition of the problem, if we have 
a cycle in Gm{S, 1) which contains no multiple edge then when we label pi the 
labeling will generate ‘pressure’ to either Cj or C^. In other words, one of the 
sub-squares of either Cj or Ck will be ‘destroyed’ and cannot be used as legal 
label for either pj or pk anymore. This holds for all the nodes involved in that 
cycle. (In Figure 2, Case I, the labeling of pi generates ‘pressure’ on Cj.) If 
Ci, Cj form a cycle with two multiple edges, similar claim holds, except that we 
need to label Pi (pj) with a sub-square which only generates one ‘pressure’ to 
Cj (Ci). (See Figure 2, Case 2.) Now we continue with our proof. 
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Ci 





Case 1 



Case 2 



Fig. 2. Illustration for the proof of Lemma 2. 



{Necessity.) Assume that there is a valid labeling for S with square triples of 
size 1. Then for each Ci, it receives at most one ’pressure’, i.e., at most one of its 
four sub-squares cannot be used as legal label for pi. Clearly, that implies that 
every connected component of Gm{S,1) contains at most one cycle. 

(Sufficiency.) It is easy to see that a leaf dangling at a cycle has no influence 
over the labeling of the centers of those nodes involved in a cycle. So we assume 
that there is no leaf node in any connected component of Gm{S, 1). Now if each 
connected component of Gm{S,1) is a cycle then obviously we can label the 
centers of all those nodes Ci involved in that cycle. □ 

Clearly the graph Gm{S, 1) has a vertex degree of at most 12. So the graph is 
of linear size and checking whether the graph contains more than one cycle can 
be done in linear time with standard graph algorithm. To obtain a polynomial 
time solution for MLUST under the discrete model all we need to do is to prove 
that there are only a polynomial number of candidates for 1. Among them, the 
largest will give us the size of the optimal solution. Let the coordinates of pi be 
{x{p^),y{pi)) and let dmin{Pi,Pj)=Toam{\x{pi)-x{pj)\, \y{pt)-y{pj)\}. (Note that 
|^oo(Pi,Pi)=max{|a;(pi) - x{pj)\, \y{pi) - y{pj)\}.) We have the following lemma. 

Lemma 3. The size of the optimal solution for MLUST under the discrete 
model is equal to either doo{pi,Pj) (in which case it is exactly Doo{S)), or 
doo{Pi,Pj) /‘^ or dmin{Pi,Pj); provided that its value is bounded by Doo{S)/2 and 
D^S). 

Lemmas 1, 2 and 3 naturally give us the following algorithm. For each site pi, 
we look at the axis-parallel square C centered at pi with edge length 4Doo(«S'). 
Clearly any point pk out of this square would be at a distance longer than 
2Doo{S), which implies doo{pi,Pk) /2 cannot be the optimal solution value for 
the problem. As any two sites in C must be at least Dao{S) distance away, we 
only need to consider a constant number (24) points in S which are the closest 
to Pi. (Overall, for all pi this can be computed in 0(nlogn) time using standard 
techniques [DLSS93].) For each such point Pj, we simply measure dao{pi,Pj) and 
drain (Pi j Pj ) ■ If C?oo (Pi j Pj ) /^ Or drain (Pi ; Pj ) is OUt of the range [Oqo (S) /2, Dqq (iF)] 
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then throw it away as a valid candidate. Eventually we have at most 2 x 24n+l = 
0(n) number of candidates. We sort them into a list l\, ..., lo(n) £^nd then we run 
a binary search over this list to decide the maximum value I* such that a valid 
labeling for S with such a value, exists. As the decision step, following Lemma 
2, takes 0{n) time, the whole algorithm takes 0(n log n) time. It is easy to 
show that ^2{n log n) is the lower bound for the MLUST problem under either 
the discrete or the sliding model, by a reduction from the element uniqueness 
problem. Summarizing the above results, we have the following theorem. 

Theorem 1. For any given set of n points in the plane, the above algorithm, 
which runs in 0{nlogn) time, produces an optimal solution for the MLUST 
problem under the discrete model. 

3 MLUST under the Sliding Model 

In this section, we shall proceed with the more interesting problem of labeling a 
set of points with sliding square triples. Because of the nature of the problem, 
not all of the three labels for a site pi could slide; in fact, only one of them could. 
We hence call the two discrete sub-squares in the label of pi base sub-squares or 
base labels. Let L* be the optimal solution of the problem MLUST under the 
sliding model. We have a lemma similar to Lemma 1. 

Lemma 4. Dao{S)/2 < L* < Dao{S). 

Our general idea is the same as that for the discrete case. We first try to design 
a decision procedure which can decide for any I G [L>ao(S)/2, L>ao(S)] whether a 
valid labeling of S with sliding square triples of size I exists. We follow the same 
procedure in the previous section to build a multi-graph Gm{S, 1). (The nodes of 
Gm{S, 1) are the set of U whose centers are Pi, for all pi G S.) However, because 
of the difference between the two models we cannot immediately have a lemma 
similar to Lemma 2. The reason is that the sliding of some label for pi might 
simply terminate any ‘pressure’ its neighbor carries over to it. In Figure 3 (a), if 
we label Pi using the shaded sliding label then the ‘pressure’ fromp^ vanishes and 




(a) 




Fig. 3. A pressure-releasing operation. 
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in Figure 3 (b), if we label pi using the shaded sliding label then the ‘pressure’ 
from pj vanishes. We call pj,pi,pk a critical triple if Cj intersects Ci and Ci 
intersects Ck- A pressure-releasing operation on a critical triple pj,pi,pk is that 
we label pi with sliding labels such that the labels generate minimum ‘pressure’ 
on either pj or i.e., the number of either pj or p^s sub-squares destroyed 
by the labels of pi is minimized and furthermore, the area of both pj and p^’s 
sub-squares destroyed by the labels of pi is also minimized. (The first condition 
implies that a pressure-releasing operation destroys either 0 or 1 sub-square of 
Pj or pfc. The second condition implies that if a pressure-releasing operation has 
to destroy a sub-square of pj or pk then it will destroy the minimum area of 
it. Finally, it is clear that we can perform 0(1) number of pressure-releasing 
operations on any given critical triple.) In this case we call Ci a cycle-breaker 
in Gm{S, 1) and clearly a cycle-breaker will terminate some pressure along some 
cycle in Gm(S,1) if we perform a pressure-releasing operation on its center. 
Therefore, we have the following revised version of Lemma 2, whose proof is 
straightforward. 

Lemma 5. There is a valid labeling for S with sliding square triples of size I if 
and only if every connected component of Gm{S, 1) has at most one cycle after 
a set of pressure-releasing operations are enumerated. 

Notice that different from the discrete MLUST problem, this lemma does not 
give us a static algorithm as does Lemma 2. In fact, at the first glance, it seems 
that the above lemma gives us an exponential solution. However, we will make 
use of the specific property of Gm{S,1) to obtain a polynomial time solution. 

What we do is as follows. As the vertex degree of any node in Gm{S, 0 is at 
most a constant (12), we can fix any node Ci and identify all possible states of 
it. That state is determined by the position of the two base sub-squares as well 
as along which path adjacent to Ci the ‘pressure’ will be generated. Clearly we 
have 0(1) number of states for Ci. 

What we do next is to fix a state of Ci and traverse the graph by following 
those paths carrying ‘pressures’ generated so far. Suppose that after visiting Cj 
and successfully labeling pj (i.e., the current state of pj is safe), we reach at 
a vertex Ck. If we can find a valid labeling of pk taking into consideration all 
the pressures generated on pk so far, then we set the state of pk as safe and we 
continue our traversal. If we reach a dead state at pk, i.e., no valid labeling of 
Pk exists (in other words, at least two of pfc’s sub-squares are destroyed), then 
we backtrack to Cj and traverse the edge (Cj,Ck) by starting at a different safe 
state of Pj, if there exists one. If we backtrack to pi and try out all its safe states 
and still cannot find a valid labeling for all the sites in S, then a valid labeling 
of S with square triples of size I does not exist. 

At the first sight, this procedure seems to take exponential time. However, 
the following lemma guarantees a polynomial time solution for fixed 1. (It is 
obvious that if we can label all sites corresponding to nodes in Gm{S,1) with 
square triples of size I then there must exist at least one node in Gm{S, 1) which 
admits a pressure-releasing operation.) 
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Lemma 6. In the above procedure, if at each cycle-breaker Cj we minimize the 
out-going pressure along (Cj,Ck) while withholding the incoming ‘pressure’ then 
for each state of Cj there is a unique state for the next cycle-breaker Cf in the 
same cycle. 

The above lemma basically shows that the procedure in the previous para- 
graph runs in linear time if we start at a Ci which admits a pressure-releasing 
operation. (For each cycle-breaker Cj we have 0(1) state, when traversing the 
graph from Cj to the next cycle-breaker Ck we can only reach exactly one state 
of Ofc. This procedure finishes either after we try all states of Ci without find- 
ing a valid labeling for S or we terminate with a valid labeling of size I for set 
S.) However, as in the optimal labeling not all Cfs admit a pressure-releasing 
operation we need to try the above procedure 0(n) times — starting at every 
possible node in the graph. Therefore, for a fixed I deciding whether we can label 
S with sliding square triples of size I can be done in 0(n x n) = O(n^) time. 
To solve MLUST under the sliding model, we must also make sure that there 
are only a polynomial number of candidates for L* . This is guaranteed with the 
following lemma. 

Lemma 7. The size of the optimal solution for MLUST under the sliding model 
L* is equal to either the optimal solution for MLUST under the discrete model 
or doo{Pi,Pj) / K for some Pi,Pj € S and some K such that 1 < K < n — 2, 
provided that its value is bounded by Doo{S)/2 and Doo{S). 

Proof. It is only necessary to discuss the situation when L* is not equal to 
the optimal solution for MLUST under the discrete model. In this situation, the 
reason that the optimal solution value L* for MLUST under the sliding model 
cannot be increased is that there exists a series of K labels p'i,...,p'p^ touching 
each other and the sum of their sizes is exactly the distance between some sites 
Pi,Pj, i.e., each p'f, contributes some distance to fill the Loo-gap between Pi,Pj 
(Figure 4). We call Pi,Pj an extreme pair. (Note that in Figure 5 the base labels 
for all the sites are not shown.) What remains to show is that each p). contributes 
exactly a distance of L* to fill the Loo-gap between Pi,Pj, which in turn implies 
that we do not need to consider any K which is larger than n — 2. Assume 
to the contrary that this is not the case, i.e., at least one of the sites, say pj., 
would contribute 2L* to fill the Loo-gap between Pi,Pj. However, this implies 
that either Pi,p). or p'f.,Pj would be an extreme pair. □ 

Similar to the previous section, Lemmas 4, 5, 6 and 7 naturally give us the 
following algorithm. For each pair of sites Pi,Pj, we simply measure dao{Pi,Pj)- 
If any of doo{Pi,Pj) / K{1 < K < n — 2) is out of the range [L>oo(*S')/2, Doo(«S')] 
then we throw it away as a valid candidate. With i,j fixed, we have at most 
0(n) number of candidates. In total we have 0{n^) number of candidates for 
L*. (We also need to test all the 0{n) candidates for the discrete problem.) We 
sort them into a list Li, ...,Lo(n^) in O(n^logn) time and then we run a binary 
search over this list to decide the maximum value L* such that a valid labeling 
for S with such a value, exists. As the decision step, following Lemma 6, takes 
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0{'n?) time, the whole algorithm takes 0{n^ log n -b x log n^) = 0{n^ log n) 
time. Summarizing the above results, we have the following theorem. 

Theorem 2. For any given set of n points in the plane, there is an 0(n^ log n) 
time solution for the MLUST problem under the sliding model. 

4 Concluding Remarks 

In this paper, we present polynomial time algorithms for the problem of label- 
ing point sites with uniform square triples, under either the discrete or sliding 
models. This is significantly different from the problem of labeling point sites 
with uniform square pairs, which is NP-hard under both the discrete and sliding 
models. An immediate question is whether we can reduce the gap between the 
fi{n log n) lower bound and the 0{n^ log n) upper bound for the general problem 
(under the sliding model). 
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1 Preliminary 

Given a simple polygon P with n vertices and a starting point s on its boundary, 
the watchman route problem asks for a shortest route in P through s such that 
each point in the interior of the polygon can be seen from at least one point 
along the route. It is known that the watchman route problem can be reduced 
in O(nlogn) time to that of computing the shortest route which visits a set of 
line segments in polygon P. In this paper, we present a simple approximation 
algorithm for computing the shortest route visiting that set of line segments. 
Our algorithm runs in 0{n) time and produces a watchman route of at most 2 
times the length of the shortest watchman route. The best known algorithm for 
computing the shortest watchman through s takes O(n^) time [3]. Our scheme is 
also employed to give a -\/2-approximation solution to the zookeeper’s problem, 
which is a variant of the watchman route problem. 

The known watchman route and zookeeper’s algorithms primarily make use 
of the reflection principle. If a and b are two points on the same side of a line L, 
then the shortest path visiting a, L and b in that order, denoted by S{a,L,b), 
follows the reflection principle, i.e., the incoming angle of path S{a, L, b) with L 
is equal to the outgoing angle of S{a, L, b) with L. To actually compute the path 
S{a,L,b), we first reflect b across L to get its image b', then draw the straight 
segment ab' from a to b' , and Anally fold back the portion of ab' lying in the 
other side of L to obtain the path S{a,L,b). See Fig. la for an example. More 
generally, to And a shortest path from a to b that visits a segment I, denoted by 
S{a,l,b), we first compute the shortest path that passes between the endpoints 
of segment I and connects a to the reflection b' and then fold back the portion 
of the path in the other side of I (Fig. lb). 

Let L{a) denote the point of L nearest to a (Fig. 1). The path consisting of 
the segments aL{a) and L{a)b, denoted by S'{a,L,b), is a good approximation 
of S{a,L,b). Note that a, L{a) and b' form an obtuse-angled triangle. Since the 
sum of lengths of two shorter edges of a right-angled triangle is the maximum 
when two acute angles are 7t/4, we have the following result. 

Lemma 1 For an obtuse-angled triangle, the sum of lengthes of two shorter 
edges is smaller than or equal to ^/2 times the length of the longest edge. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 201-206, 2001. 

© Springer- Verlag Berlin Heidelberg 2001 
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Fig. 1. The reflection principle and its approximation. 

Analogously, let l{a) denote the point of the segment I nearest to a (Fig. lb). 
Then the path consisting of two segments al{a) and l{a)b, denoted by S'{a, I, b), 
is also a -\/ 2 -approximation of the path S{a, I, b). 

2 Approximating the Shortest Watchman Route 

Let P be a polygon with a point s on its boundary. A vertex is reflex if its 
internal angle is greater than tt. Polygon P can be partitioned into two pieces 
by a “cut” that starts at a reflex vertex v and extends either edge incident to v 
until it first hits the boundary. We say a cut is a visibility cut if it produces a 
convex angle (< tt) at u in the piece of P containing s. For a cut C, the piece of 
P containing s, denoted by P{C), is called the essential piece of C. We say cut 
Cj dominates cut Ci if P{Cj) contains P{Ci). A cut is called an essential cut 
if it is not dominated by any other cuts. Clearly, a route is a watchman route 
if and only if it visits all essential cuts. (All essential cuts can be computed in 
0(n log n) time.) 

Let Cl, C2, • • •, Cm the sequence of essential cuts indexed in the clockwise 
order of their left endpoints, starting at s. Also, let s = sq = Sm+i- First, we 
And the point of Ci that is closest to sq in P{Ci)- Let si denote the point found 
on Cl, which is called as the image of sq on Ci. Similarly, we can And the images 
of So on C2, C3 and so on. In the case where Sk is not contained in P{Ck+i), 
the image s^+i of Sq on Ck+i is undefined, and the image Sk+2 of Sq on Ck+2 is 
considered instead. The computation of images of sq is terminated when a new 
image s^+i does not dominate all the cuts before it (i.e., Ci, . . . , Ci). Next, we 
take Si as a new starting point and compute the images of Si on the cuts after 
it. This procedure is repeatedly performed, until the image on Cm is computed 
(or considered). In particular, we call the images, which are considered as the 
starting points, the critical images. See Fig. 2 for an example where images sq 
(= se), S2, S4 and S5 are critical. (Image Sm+i is not shown in our figure.) 

Let R' denote the route which is a concatenation of the shortest paths be- 
tween every pair of adjacent critical images Si and Sj ( 0 < i < j < m + 1 ). 
Clearly, R' is a watchman route, which gives our approximation solution. Let 
R denote the shortest watchman route through s. An example of routes R' and 
R is shown in Fig. 2 . In the following, we denote by \xy\ the length of a line 
segment xy, and jZj the length of a route Z. 
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Fig. 2. Critical images and routes R' , R. 



A watchman route R is said to make a reflection contact with cut C ii R 
comes into C at some point and then reflects on C and goes away from that 
point . 

Lemma 2 If route R reflects on cut Ci and the critical image Si is defined, then 
Si is to the left of the reflection point of R on Ci (as viewed from sgj- 

Proof. Omitted in this extended abstract (see also Fig. 2 for an example). □ 

Theorem 1 For any instance of the watchman route problem, |i?'| < 2|i?|. 
Moreover, the watchman route R' can he found in 0(n) time, provided that the 
set of essential cuts is given. 

Proof. Assume first that route R' does not touch the boundary of the given 
polygon P (except at s). Let Si and Sj {1 < i < j < m) denote the first two 
critical images (excluding sq ). If Sj is contained in PiCf), it then follows from 
Lemma 2 that two routes R and R' intersect before Sj. Let t be the intersection 
point immediately before Sj (along the route R'), and R'^t I^e parts of 
routes R, R' from sq to t, respectively. Let t' denote the point obtained by 
reflecting t across Q. Since the chain of segments and sfi' gives the unfolded 
route R'q we have that |sot'| < |f?o,t|- On the other hand, since three points so> 
Si and t' form an obtuse-angled triangle, it follows from Lemma 1 that |i?o J < 
v^lsof'l, and thus \R[, t\ < 

Consider the situation where Sj is not contained in P{Ci). Let t be the first 
intersection of route R' with R, immediately after sj. (See also Fig. 2 for an 
example where Si = S2 and Sj = 54 .) Since the next critical image is contained 
in P{Cj) (otherwise, Sj were not critical), point t is contained in P{Cj), too. 
Let t' denote the point obtained by reflecting t across Cj. Since Si, sj and t' 
form an obtuse-angled triangle, |i?' J < ^/2\sit'\. Let R'^ ^, denote a new route 
that consists of and sfi' . Then we have that |i?o (/| < V^\sot''\ and |so^"| < 
■\/2|i?o,t|, where t" is the point obtained by reflecting t' across Ci if t' is contained 
in P{Ci) (see also Fig. 2), or t" = t' otherwise. Since \R'ot,\ = |soSi| -I- |si<'| > 

|soSi| -b > ^(|soSi| -b |i?',t)| = we have that < 2\Rofi. 
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The rest parts of R and R' can inductively be considered by taking t as & new 
starting point. Note that if the rest part of R' , say, R't^rn+n contains only one 
critical image (excluding Sm+i), then \R[ I < V2\Rt,m+l\- 

Let us now remove the assumption that R' does not touch the boundary of 
polygon P. Consider first the case where Si is the left endpoint of Q. If the 
incoming angle of R' with Ci is larger than the outgoing angle (Fig. 3a), the 
first part of the theorem is still true. Otherwise, it follows from Lemma 2 that Si 
should be visited by the route R (Fig. 3b). Then, the first part of the theorem 
can inductively be proved for two parts of R and R' separated by Sj (i.e., one 
starts at sq and ends at Si, and the other starts at Si and ends at Sm+i)- The 
other case having to be dealt with is that the shortest path between two critical 
images, say, Si and Si+i, wraps around some reflex vertices of P (i.e., the path 
makes left turns at these vertices when one follows it in the clockwise direction) . 
See Fig. 3c. Let r and r' denote the last and penultimate reflex vertices of P 
touched by the shortest path between Si and s^+i. Note that r' is the first point 
of the shortest path between Si and Si+i, which is visible to some portion of 
Ci+\. It follows from Lemma 2 that R has to touch r' and/or r. If r is visited 
by R, the first part of the theorem can inductively be proved for two parts of 
R' and R, separated by the vertex r. In the situation where only r' is visited 
by R, the similar treatment can be done for the parts of R and R' from sq to 
r' . Let Rr'^m+i and R'^., denote the parts of R and R' from r' to Sm+i, 
respectively. Since r is a reflex vertex and Si+i is invisible to r' , we extend the 
segment rsf+I until the foot of a perpendicular to the extension and through 
r' is reached (Fig. 3d). Let R", denote the enlarged route. It is easy to see 
that rn+i\ < l-^r' m+il — 2|i?r',m+i|- K Completes the proof of the first part 
of the theorem. 

Finally, the watchman route R' can be computed in 0(n). Due to space limit, 
we omit the detail. □ 




Fig. 3. Illustration for the proof of Theorem 1. 
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3 Approximating the Shortest Zookeeper’s Route 

The zookeeper’s problem is defined as follows: Given a starting point s and a 
simple polygon {zoo) P with a set P of disjoint convex polygons (cages) inside 
it, each sharing one edge with P, we want to find the shortest route R through 
s that visits (without entering) at least one point of each cage in V. The best 
algorithm for computing the shortest zookeeper’s route takes O(nlog^n) time 
[1]. Jonsson also gave a 6-approximation solution to the zookeeper’s problem [2]. 

Let P\,. . . , Pm denote the cages indexed in a clockwise scan of the boundary 
of zoo P, starting at s. Also we find a point (the image) on the boundary of 
P\ that is closest to Sq in the interior of zoo P (i.e. P — V), a point S 2 on 
the boundary of P 2 that is closest to si, and so on. Putting the shortest paths 
between Si and s^+i for alH, 0 < z < m, gives a zookeeper’s route R' . 

Theorem 2 For any instance of the zookeeper’s problem, |i?'| < -\/2|A|. More- 
over, the zookeeper’s route R' can he found in 0(n) time. 

Proof. Our proof is done by constructing a route R" such that |i?'| < ^/2\R"\ 
and \R"\ < |i?|. Let be the edge of cage Pi containing the image s,. Project 
all vertices of Pi onto the line containing e^, and then extend edge Ci in the 
interior of zoo P rightward (as viewed from s) until the extreme of the projected 
vertices is reached (or as longer as possible otherwise) . See Fig. 4a for an example. 
Let E denote the set of the line segments, which are obtained by extending all 
image-defined edges, and let R" be the shortest route computed by the unfolded 
method, using the line segments in if as mirrors. (Route R" may not be a 
zookeeper’s route.) Since R' can also be considered as the route reflecting on all 
elements of E, it follows the proof of Theorem 1 that |i?'| < -\/2|R"|- 




Fig. 4. Illustration for the proof of Theorem 2. 



Assume that a and b are two points on the same side of the edge e of a 
convex polygon Q with a.x < b.x and that the foot of a perpendicular through a 
is contained in e (Fig. 4b). Let e' denote the line segment obtained by extending 
edge e rightward until the extreme of the projected vertices of Q is reached, 
and S{a,e',b) the shortest path visiting a, e' and b in that order. Let S{a,Q,b) 
denote the shortest path that visits a, some point on the boundary of Q, and 
b in that order. Let c denote the first intersection of S{a, Q, b) with e'. Clearly, 
|S'(a, e',&)| < |ac| -I- |c&| < |S'(o, Q,&)| (Fig. 4b). It implies that \R"\ < |i?|. 

Again, the time taken to compute the route R' is 0(n). □ 
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Abstract. A data structure called PC-tree is introduced as a generalization of 
Pg-trees. PC-trees were originally introduced in a planarity test of Shih and 
Hsu where they represent partial embeddings of planar graphs. Pg-trees were 
invented by Booth and Lueker to test the consecutive ones property in matrices. 
The original implementation of the Pg-tree algorithms by Booth and Lueker 
using nine templates in each bottom-up iteration is rather complicated. Also the 
complexity analysis is rather intricate. We give a very simple PC-tree algorithm 
with the following advantages: (1) it does not use any template; (2) it does all 
necessary operations at each iteration in one batch and does not involve the 
cumbersome bottom-up operation. PC-trees can be used naturally to test the 
circular ones property in matrices. And the induced Pg-tree algorithm can 
considerably simplify Booth and Lueker’ s modification of Lempel, Even and 
Cederbaum’s planarity test. 



I. Introduction 

A data structure called PC-tree is considered here as a generalization of Pg-trees. PC- 
trees were originally introduced to represent partial embeddings of planar graphs in 
Shih and Hsu [7]. Pg-trees were used to test the consecutive ones property in 
matrices [1]. However, the implementation of Pg-tree algorithms by Booth and 
Lueker [1] (hereafter, referred to as the B&L algorithm) is rather complicated. Also 
the complexity analysis is rather intricate. We shall present a very simple PC-tree 
algorithm without using any template. Furthermore, we shall illustrate how to test the 
circular ones property in matrices using PC-trees. 

Pg-trees were invented for the more general purpose of representing all 
permutations of a set U that are consistent with constraints of consecutiveness given 
by a collection C of subsets of U with the convention that the element of each subset S 
in C must occur consecutively in the permutation. 

The literature on problems related to Pg-trees is quite extensive. Pg-trees can be 
applied to test the consecutive ones property of (0,l)-matrices [1,3], to recognize 
interval graphs [1,2] and to recognize planar graphs efficiently [1,6]. Korte and 
Mohring [5] considered a modified Pg-tree and a simpler incremental update of the 
tree for the purpose of recognizing interval graphs. Klein and Reif [4] constructed 
efficient parallel algorithms for manipulating Pg-trees. On the other hand, PC-trees 
were initially used in Shih and Hsu [7] (S&H) to represent partial embeddings of 
planar graphs. Note that this approach is entirely different from Lempel, Even and 
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Cederbaum’s {LEC) approach [6] of using Pg-trees to test the consecutive ones 
property of all nodes adjacent to the incoming node in their vertex addition algorithm. 
In S&H, a P-node is a regular node of the graph, a C-node represents a biconnected 
component and nodes adjacent to the node in consideration can be scattered anywhere 
around the PC-tree. However, in LEC, the leaves of the Pg-tree must be those nodes 
adjacent to the incoming node. 

In this paper we shall focus on the application of Pg-trees to (0,l)-matrices. A 
(0,l)-matrix Mhas the consecutive ones property {COP) for the rows iff its columns 
can be permuted so that the ones in each row are consecutive. M is said to have the 
circular ones property {CROP) for the rows if either the ones or the zeros of each row 
are consecutive. When the zeros of a row are consecutive the ones will wrap around 
the first and the last column. 

A Pg-tree is a rooted tree T with two types of internal nodes: P and g, which will 
be represented by circles and rectangles, respectively. The leaves of T correspond 1-1 
with the columns of M. 

" 1 2 3 45 6 " 

1 1 0000 
011000 
111100 
000111 
^0000 1 1 ^ 

Fig. 1. A Pg-tree and the consecutive ones property 

We shall follow the notations used in [1]. The frontier of a Pg-tree T, denoted by 
FRONTIER{T), is the permutation of the columns obtained by ordering the leaves of T 
from left to right. Such a permutation is called a consistent permutation. The B&L 
algorithm considers each row as a constraint and adds them one at a time. Each time a 
new row is added, the algorithm fries to modify the current Pg-tree to satisfy the 
consecutiveness constraint of columns of the newly added row. To guarantee unique 
Pg-tree representations we need to restrict ourselves to proper Pg-trees defined in 
[1]: every P-node has at least two children; every g-node has at least three children. 
Two Pg-trees are equivalent iff one can be transformed into the other by applying 
zero or more equivalent transformations. The equivalence of two trees is written T ~ 
T . There are two types of equivalent transformations: 

1 . Arbitrarily permute the children of a P-node, 

2. Reverse the order of the children of a g-node. 

Denote the set of all consistent permutations (or frontiers) of a Pg-tree T by 
CONSISTENT {T) = {FRONTIER {T')\T' ~T}. Given a Pg-tree T and a row u of 
M, define a new free called the u-reduction of T to be one whose consistent 
permutations are exactly the original permutations in which the leaves in u occur 
consecutively. This new free is denoted by REDUCE{T,u). Booth and Lueker gave a 
procedure REDUCE to obtain the M-reduction. The procedure applies a sequence of 
templates to the nodes of T. Each template has a pattern and a replacement. If a node 
matches the template’s pattern, the pattern is replaced within the free by the 
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template’s replacement. This is a bottom-up strategy that examines the tree node-by- 
node obeying the child-before-parent discipline. 

A node is said to be full if all of its descendants are in u\ It is empty if none of its 
descendants are in m; If some but not all of the descendants are in u, it is said to be 
partial. Nodes are said to be pertinent if they are either full or partial. The pertinent 
subtree of T with respect to u, denoted by PERT1NENT(T, u), is the subtree of 
minimum height whose frontier contains all columns in u. The root of the pertinent 
subtree is denoted by ROOT(T, u), which is usually not the root of the entire tree. 
There are nine templates used in [1]. 

In order to achieve optimal efficiency, the algorithm takes several precautions in 
scanning the pertinent subtree. For example, the maintenance of parent pointers may 
cause a problem. It is possible that almost all nodes in the tree may receive a new 
parent even though the row u has only two elements. Therefore, parent pointers are 
only kept for children of P-nodes and for endmost children of g-nodes. Interior 
children of a g-node do not keep parent pointers. Rather, they borrow them from their 
endmost siblings. These measures unavoidably complicate the implementation of the 
B&L algorithm. 

A PC-free is a rooted tree T with two types of internal nodes: P and C, which will 
be represented by circles and double circles, respectively. The leaves of T correspond 
1-1 with the columns of M. Figure 2 gives an example of a matrix satisfying the 
CROP (but not the COP) and its corresponding PC-tree. In obtaining consistent 
permutations, the children of a P-node can be permuted arbitrarily whereas the 
children of a C-node observe a circular list, whose order can only be changed from 
clockwise to counter-clockwise. The FRONTIER(T) of a PC-free consists of those 
consistent circular permutations. 

" 123456 " 

1 10000 
01 1000 
111100 
. 0001 1 1 > 

Fig. 2. A PC-tree and the circular ones property 

For each Pg-tree, we can obtain its corresponding PC-tree by replacing the Q- 
nodes with C-nodes. Note that flipping a g-node in a Pg-free is the same as changing 
the circular children list of its corresponding C-node from clockwise to counter- 
clockwise order. There are a few differences between PC-trees and Pg-frees: (1) 
When a Pg-tree is transformed into a PC-tree, and its root is a g-node, then its two 
endmost children must be preserved in order to maintain the COP in any consistent 
circular permutation of the transformed PC-free. Aside from this restriction, all other 
PC-tree operations will yield equivalent Pg-tree operations. (2) Since it is the circular 
order of the leaves that needs to be preserved (for example, in Figure 2, we can rotate 
the tree so that FRONTIER becomes 2, 3,4, 5, 6,1), one might as well fix the first 
column to be 1 in a PC-tree in considering any consistent circular permutation. (3) 
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The root of a PC-tree in general is not essential except to maintain the child-parent 
relationships. 

With the additional freedom of rotation, it is natural for PC-trees to test the CROP. 
Since our purpose is to illustrate how to replace the complicated bottom-up template 
matching strategy in B&L algorithm, we shall concentrate on testing the COP using 
PC-trees. 

Our improvement over the B&L algorithm is based on a simple observation that 
there exist at most two special partial nodes and the unique path connecting them 
gives a streamline view of the update operation. In particular, no template is necessary 
in our PC-tree algorithm and the node-by-node examination is replaced by one swift 
batch operation. 



2, A Forbidden Structure of Consistent Permutations 

A key observation that simplifies our PC-tree algorithm is based on the following 
forbidden structure of consistent permutations. 

Theorem 1. The following structure { 5 i, S2, S3} is forbidden in any consistent 
permutation of the columns of M that preserves the COP. 

Let Si = {ai,bi}, S2 = {02,^2} and S3 = {«3,^>3} be three subsets of columns such that 

( 1 ) Columns «i, «2 and 03 are distinct from each other. 

( 2 ) None of the a/s is the same as any of the bi’s (but bi, ^2 and ^3 are not 
necessarily distinct). 

( 3 ) No two columns in any Si are separated by any column in the other two subsets. 
Furthermore, bj, ^2 and 63 are not separated by any of the «,’s. 

Proof. If bi = ^2, then from ( 3 ), we must have the arrangement ai...b](= b2)---«2--- 
But then, it would be impossible to place ^3 anywhere in the arrangement unless bi = 
^2 = ^3. In the latter case it would be impossible to place 03 anywhere in the 
arrangement. 

Hence, assume bi ^2. From ( 3 ), we must have the following arrangement 
a\...b\...b2...a2... (or the reverse) in any consistent permutation of the columns. 
Since b\, b2 and Z>3 must not be separated by the a,’s, we shall have 
a\...b\...b3...b2...a2... Note that in the above arrangement, ^3 is allowed to be the 
same as either b i or b2. But then, it would be impossible to place 03 anywhere in the 
arrangement. 



3, Terminal Nodes and Paths 

Define a partial node to be a terminal node if none of its children is partial. In other 
words, each child of a terminal node is either empty or full. Terminal nodes play a 
major role in simplifying our tree -update operation. 

Theorem 2. If the given matrix M satisfies the COP, then there can be at most two 
terminal nodes in T at every iteration. 




PC-Trees vs . Pg-Trees 2 1 1 



Proof. Suppose, on the contrary, there are three terminal nodes t\, ?2, and Choose 
an empty child a, and a full child hi from each of the t„ i = 1 , 2 , 3 . Then, 5 i = 

S2 = {02,^2} and 5 s = {03, ^>3} constitute a forbidden structure as described in Theorem 

1 . 



If there are two terminal nodes ti and t2, define the unique path connecting ti and ?2 
to be the terminal path. In case there is only one terminal node t, the terminal path is 
defined to be the unique path from t to ROOT{T,u). In the latter case, t is called a 
special terminal node if it is a C-node and its two endmost children are empty. That 
is, its full children are consecutively arranged in the middle. 

Now, any C-node in the interior of the terminal path has its children (not on the 
path) divided into two sides. Since one can flip the children ordering along the 
terminal path to obtain an equivalent PC-tree, we shall show that there is a unique 
way to flip the children of these C-nodes “correctly”. Denote ROOT(T,u) by m. 

Theorem 3 . If M satisfies the COP and there are two terminal nodes ?i and ?2, then 

( 1 ) Every C-node in the interior of the terminal path satisfies that its full children 
together with its two neighbors on the path are consecutive in its circular list. 
Therefore, they can be flipped correctly. 

( 2 ) Node m must be on the terminal path. 

( 3 ) Let w be a child of a node (other than t) on the unique path from m to the root of T 
such that w itself is not on this path. Then w must be empty. 

Proof. Consider the following cases: 

( 1 ) If a C-node w does not have any full child, we are done. Hence, assume w 
has a full child d. Let the two neighbors of w in the terminal path be and S2- 
Suppose, in traversing the circular list of w from to ^2 through the full child di, we 
encounter an empty child J2, say in the order ^i, di, d2, S2- Let d\, J2’ be two leaves of 
di, d2, respectively. Choose an empty child a, and a full child bi from each of the t„ i = 
1,2. 

Then S2 = {ai,bi}, S2 = {02,^2} and ^3 = {di\d2"} constitute a forbidden structure 
as described in Theorem 1 . 

( 2 ) This is obvious. 

( 3 ) Suppose w has a leaf b in u. Since we have ...a\...b\...b2...a2...{ox its 
reverse) in any consistent permutation of the columns, it would be impossible to place 
b anywhere in the arrangement. 

Theorem 4 . Consider the case that M satisfies the COP and there is only one special 
terminal node t. Let w be a child of a node (other than m) on the path from m to the 
root of T such that w itself is not on the path. Then w must be empty. 

Proof. Suppose w has a leaf b in u. Let a\, «2 be any leaf of the two endmost children 
of t, respectively and b\ a leaf in a full child of t. Then we shall have a\...b\..a2 (or 
its reverse) in any consistent permutation of the columns. But then, it would be 
impossible to place b anywhere in the arrangement. 

By considering the terminal nodes we have a global view of the distribution of all 
full nodes by Theorems 3 and 4 . Figure 3 illustrates an example with two terminal 
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nodes. Note that, in this example, we have flipped all full children of nodes on the 
terminal path down. 



u 



Fig. 3. The unique separating path between two terminal nodes u and u ’ 

4, Constructing the New PC-Tree 

Rather than using the node-by-node tree modification in the B&L algorithm, we shall 
update the current PC-tree to the new one in a batch fashion. 

When a new row u is added, perform the following labeling operation: 

1 . label all leaf columns in u full. 

2. the first time a node becomes partial or full, report this to its parent. 

3. the first time a node x gets a partial or full child label x partial 

4. the first time all children of a node x become full label x full 

Our batch tree -update operation consists of the following steps: 

1 . delete all edges on the terminal path 

2. duplicate the nodes on the terminal path (this is called the splitting operation) 

3. create a new C-node w and connect w to all duplicated nodes according to their 
relative positions on the terminal path as follows (this is called the modifying 
operation)', connect w directly to all P-nodes; connect w to all full children of C- 
nodes according to their original ordering; contract all degree two nodes 

The following figures illustrate the splitting and the modifying operation on the 
graph of figure 3. Note that the root in a PC-tree does not play a major role in these 
operations. 





Fig. 4. The splitting operation 
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Fig. 5. Connecting to the new C-node 




Fig. 6. The modifying operation 



Theorem 5. The corresponding Pg-tree of the newly constructed PC-tree is the same as the 
one produced by the B&L algorithm. 

Proof. As noted before, the B&L algorithm modifies the tree in a node-by-node 
bottom-up fashion based on 9 templates. Whenever a pattern is matched, it is 
substituted by the replacement. 

We shall prove the theorem by induction on the depth of the pertinent subtree. For 
each template operation of the Pg-tree, we shall illustrate the corresponding splitting 
operation of the PC-tree and show the equivalence of the two operations in terms of 
the resulting Pg-tree. We shall skip the easy cases ofPO and PI, gO and gl. 

=> 

Template PI for ROOT (TS) when it is a P-node 

? ? 

, 0 ^' O'’ 

^ ^ n 

Template P3 for a singly partial P-node which is not ROOT {T,S) 




Template P4 for ROOT(T,S) when it is a P-node with one partial child {T,S) 



? ? 




Template P5 for a singly partial P-node, other than ROOT{T,S), with one partial child 



Fig. 7. The template operations of Booth and Lurker’s algorithm 
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Template P6 for ROOT(T,S) when it is a doubly partial P-node 




Template Q2 for a singly partial 2-uode 




Template Q3 for a doubly partial Q-node 



Fig. 7. (Continued) 

First, consider templates at the root of T. In Figure 8 we consider template P2. The 
top part of Figure shows the PQ-tree replacement. The bottom part gives the 
corresponding PC-tree splitting and modifying operation. The equivalence is shown 
by the two rightmost diagrams, in which a PQ-tree (on top) and its corresponding PC- 
tree (in the bottom) are obtained through B&L template matching and the PC-tree 
operation, respectively. 




Fig. 8. Template P2 for ROOT (T,S) when it is a P-node when it is a P-node 



The same goes for the templates P4, P6, Q2 and Q3 as shown in Figures 
9,10,11,12. 




Fig. 9. Template P4 for ROOT{T,S) when it is a P-node with one partial child (TS) 
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Now, assume the theorem is true for Pg-trees whose pertinent subtree is of depth k 
and consider a tree whose pertinent subtree is of depth k+\. In the Pg-tree operation, 
a template will be applied to a non-root terminal node x of the current Pg-tree Pi, to 
obtain a new Pg-tree P 2 . To simplify the argument we assume the pertinent subtree of 
P 2 has depth k. Since, by induction, further Pg-tree operations on P 2 can be done 
equivalently through the corresponding PC-tree operations, we only have to show that 
the template matching operation on node x results in the same subtree as the one 
obtained by operation on the corresponding PC-tree of P 2 . These are illustrated in 
Figures 13 and 14. 





Fig. 10. Template F6 for ROOT{T,S) when it is a doubly partial P-node 
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Fig. 13. Template P3 for a singly partial P-node which is not ROOT (T,S) 




Fig. 14. Template P5 for a singly partial P-node, other than ROOT{T,S), with one partial child 



End of Proof of Theorem 5 



5. The Complexity Analysis 

In the B&L algorithm, the bottom-up propagation potentially could examine the same 
node many times. Hence, special care must be taken to ensure efficiency. In our 
splitting and modifying operation of PC-trees the update is done in one batch. 

We shall apply the same pointer strategy as in the B&L algorithm with regard to P- 
nodes and C-nodes. Our major saving is in skipping the pattern-matching phase at 
each node. The remaining replacement phase will be the same as in the B&L 
algorithm except that we use C-nodes instead of (?-nodes. 

The detailed operations needed for PC-trees are all included in those of the B&L 
algorithm. Hence, the running time is linear in the size of the input. 



6. Conclusion 

We believe that a more efficient implementation of our PC-tree algorithm deviating 
completely from the pointer operation of B&L algorithm is possible. However, the 
purpose of this paper is to demonstrate that, conceptually, there is a simpler way to 
view the Pg-trees and their updates. Implementation issues of PC-trees will be left 
for those who have the experience of implementing the B&L algorithm. 
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Abstract. We investigate the relative efficiency of a finite number of 
stacks in comparison to several variants of deques. In the nondetermin- 
istic setting, two stacks can simulate a general deque in linear time. This 
implies a negative answer to the question raised by Brandenburg whether 
a deque can simulate a finite number of tapes in linear time. We also show 
that in realtime an output-restricted deque cannot simulate two stacks 
for deterministic computations. It is known that a general deque can be 
simulated deterministically by three stacks in linear time. We describe 
an approach that is simpler to analyze and has a smaller constant factor 
(with respect to the required stack operations) than a previous solution. 



1 Introduction 

Functional programming with LISP-like data raises the question whether fa- 
miliar data structures like queues and deques (double-ended queues) have effi- 
cient implementations in the available ‘one-ended’ lists or stacks. By efficient 
we mean that the simulator operates in (cumulative) linear time or realtime, 
i.e., the number of internal operations for each simulated step is bounded by 
a constant. Notice that the latter is stronger, since it rules out techniques like 
garbage-collection. On an abstract level, it has been known for a long time 
that sequential storage with multiple access-pointers — and hence queues and 
deques — can be simulated by sequential storages with single pointers in linear 
time [17] and even realtime [5,11]. Since each linear storage can be split into two 
stacks, this shows the somewhat surprising fact that (up to constant factors) 
no speed is lost if queues and deques are implemented in functional languages. 
Concrete implementations of queues have been given by Hood and Melville [8], 
while Chuang and Goldberg extended this to deques [4]. The known realtime 
simulations however require a large number of stacks per simulated queue or 
deque, while there is an obvious way to do a linear time simulation of a single 
queue by two stacks. It is also known that a deque can be simulated in linear 
time by three stacks [16]. Questions that arise are: What is the minimum number 
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of stacks sufficient for linear or realtime simulation of queue and deque? Does 
the simulation of a deque become easier if the access to data is restricted? 

In the area of formal languages, Ayers [1] investigated automata with a single 
deque that work in realtime (there called quasi-realtime). In his comments on [I], 
Brandenburg [3] stated several open problems. He asked whether the hierarchy 
of language-families accepted by nondeterministic machines working in realtime 
(or linear time) with a queue, an output-restricted deque, a deque, or a finite 
number of tapes is proper. These questions have partially been answered. Li e.a. 
[13] showed that in general even a deterministic machine with a single stack 
(which obviously can be simulated by an output-restricted deque) cannot be 
simulated in linear time by a nondeterministic machine with a single queue. An 
output-restricted deque can be simulated by two stacks in linear time, which by 
the nondeterministic realtime simulation of many tapes by three stacks [2] and 
the lower bound of [12, Corollary 3] shows that an output-restricted deque is 
weaker than a finite number of tapes. 

The results presented here concentrate on a small number of stacks and the 
relation to variants of the deque. 

2 Preliminaries 

We adopt the terminology from [10]. In our investigations, stacks, queues, and 
deques are linear lists of atomic items or symbols. Information can be stored and 
retrieved (along with each deletion) at the ends of the list. In addition, emptiness 
of the storage can be sensed. More precisely we distinguish between: 

Stack: All insertions (push operations) and deletions (pop operations) are made 
at one end. 

Queue: All insertions are made at one end, all deletions at the other end. 
Output-restricted deque: Insertions can be made at both ends of the list, 
while deletions are allowed at one end only. 

Input-restricted deque: Deletions can be made at both ends of the list, while 
insertions are allowed at one end only. 

Deque: Insertions and deletions can be made at both ends of the list. 

Note that in comparison to [10] we exclude any access to the interior of the 
lists 

The usual framework in which these data storages are compared is by attach- 
ing them to a finite control that has a separate read-only one-way input-tape 
and either accepts or rejects an input, see e.g. [15,13]. 

3 Nondeterministic Simulation of a Deque by Two Stacks 

We first consider nondeterministic machines and show that — up to a linear time 
overhead — machines with a single deque are equivalent in accepting power to 
those equipped with two stacks. 

^ What is called stack here is also known as a pushdown-store. The name ‘stack’ is 
often used for a storage that admits read access to its interior, which we don’t allow. 




220 Holger Petersen 



Theorem 1. Every t{n) time-hounded nondeterministic deque machine can he 
simulated hy an 0{t(ji)) time-hounded nondeterministic two-stack machine. 

Proof. We will divide the simulation into two stages. The first stage simulates in 
realtime a deque by a queue and two stacks Sl and Sr with some restrictions on 
the way the queue can be accessed. In the second stage we show how to simulate 
a queue and two stacks with two stacks within the time-bound claimed in the 
theorem. 

Without loss of generality we assume that the deque is empty when the 
computation terminates. We claim that a single deque can be simulated by a 
queue and two stacks step by step as follows. The simulator starts with empty 
stacks and an empty queue. It reads input and executes operations in accordance 
to the finite control of the deque machine. In addition the simulator is in one of 
two modes, either in left-right or in right-left mode (these names will be explained 
later) . The initial mode can be chosen arbitrarily and during the simulation the 
simulator may switch from one mode to the other whenever the queue is empty. 
The possible deque operations are now simulated in the following way: 

Insert at the left end: Nondeterministically choose to push the new symbol 
onto Sr or, provided Sr \s empty and the current mode is left-right, insert 
the new symbol into the queue. 

Insert at the right end: Nondeterministically choose either to push the new 
symbol onto Sr or, provided Sr is empty and the current mode is right-left, 
insert the new symbol into the queue. 

Delete at the left end: If Sr is not empty, pop one symbol. If Sr is empty, 
remove one symbol from the queue, provided it is not empty and the current 
mode is right-left. Otherwise abort the simulation. 

Delete at the right end: If Sr is not empty, pop one symbol. If Sr is empty, 
remove one symbol from the queue, provided it is not empty and the current 
mode is left-right. Otherwise abort the simulation. 

Sense emptiness: Check that Sr, Sr, and the queue are empty. 

The main invariant of the simulation (which holds after the simulation of one 
deque access) is, that the current deque contents of the simulated machine are 
always encoded by the concatenation of the contents of Sr with its top to the 
left, the queue either with its rear to the left (in left-right mode) or to the right 
(in right-left mode), and Sr with its top to the right. By checking the simulations 
of the four operations on the deque and their subcases it can be verified that 
this invariant is maintained and that the access is always to the symbols at the 
encoding of the ends of the deque. We thus see that each successful run of the 
simulator corresponds to an accepting computation of the deque machine. 

Conversely, we have to argue that every accepting computation of the deque 
machine can be simulated successfully. Fix an accepting computation of the 
deque machine. We can associate with every symbol stored on the deque (which 
we assume to be distinguishable for our considerations) two bits of information: 
whether it entered the deque from the left or the right end and whether it 
will leave the deque at the left or the right end (we assumed that the deque 
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will eventually be empty). The symbols will be sorted according to each of the 
components, which leads to the important observation that in each configuration 
of the deque there is at most one type of symbols with a ‘mixed mode’ (left-right 
or right-left). Let us assign to each configuration of the deque the ‘mixed mode’ 
of its contents (or an arbitrary mode if there is no symbol with mixed mode). The 
simulator now follows the information associated with the symbols and always 
works in the mode prescribed by the mode of the configuration of the deque. It 
does so by storing the left-left symbols on Sl, the right-right symbols on Sr, and 
all mixed-mode symbols (left-right or right-left) on the queue. Clearly the mode 
of the deque can only change when there is no mixed-mode symbol, and then 
the queue of the simulator is empty allowing it to switch between the modes 
as well. If the deque contains symbols that should be deleted at the left end, 
the simulator will either store them on Sr, where they are readily available, or 
they are mixed-mode symbols, which forces them to be at the front of the queue 
where they can be accessed by the simulator. An analogous argument shows that 
deletions at the right end can be carried out faithfully. 

By the above discussion each computation of the deque machine can be 
simulated and each run of the simulator corresponds to a computation of the 
deque machine. 

Now we show how to simulate the two stacks and one queue on two stacks. 
The simulated stacks will be mapped onto the corresponding stacks of the sim- 
ulator, with the additional property that there is a bottom marker below each 
portion encoding a simulated stack. Below the bottom markers two strings are 
stored, where the shorter string is a suffix of the longer if the simulation is to 
be completed successfully (if both strings have the same length, they should be 
identical). If the simulated machine is in left-right mode, then the string below 
the marker stored on Sr is at least as long as the one on Sr and vice versa for 
right-left mode. If a symbol is to be entered into the queue, then this symbol 
is added to the ‘longer’ string, if a symbol is deleted it is guessed and added 
to the shorter string. Note that accessing the strings below the marker is al- 
ways possible since the corresponding stack will be empty at that point of the 
simulation. 

If eventually an accepting state has been reached, the contents of the two 
stacks are compared symbol by symbol in 0{t{n)) time and the input is accepted 
if they are identical. 

The simulation strategy described so far admits a simulation of each valid 
computation. It may however include invalid computations as well for two rea- 
sons. First, switching from one mode to the other is allowed only if the queue is 
empty, and the simulator has to verify this condition. Second, even if switching 
from one mode to the other occurs only at valid points during the simulation, 
symbols might be deleted from the simulated queue before they have been in- 
serted. The first problem is overcome by inserting a special zero marker into 
the guessed strings after the last symbol has been entered resp. deleted just be- 
fore switching the mode. Consider a situation when the simulator is in left-right 
mode. Then after the last symbol during this phase has been stored on Sr the 
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simulator adds the zero marker to the string on Sr and records that the marker 
has been set. Note that above this zero marker the stack can still be simulated, 
but further insertions are not allowed until another zero marker is stored on 
and the mode changes to right-left. The simulator works analogously in right-left 
mode. Zero markers necessarily occur in pairs, one on each stack of the simu- 
lator. Therefore the length of the strings at steps when the mode changes are 
equal (assuming that the final comparison is successful) and the simulated queue 
is empty at these points. 

For the second problem we associate with each symbol a flag from the set 
{0, 1} and we let the simulator remember the current flag /. With each symbol 
inserted into the queue, the simulator associates / -I- 1 (mod 2), with every 
deleted symbol the flag /. If the current value of the flag has been used for at least 
one insertion it may nondeterministically be incremented by 1 modulo 2. Assume 
for a contradiction that the resulting strings could be equal although at least one 
symbol has been deleted from the queue being simulated before it was inserted. 
Consider the situation just before this happens for the first time. The lengths of 
the strings will be identical and the flags of the topmost pair of symbols match, 
both having the value / (since we assumed that the strings would eventually be 
equal). The flag has been incremented since the last insertion without having 
been used for another one. No further modification of the flag is possible before 
the next symbol is inserted into the simulated queue. Therefore the (erroneously) 
deleted symbol gets flag / as well. The next symbol to be inserted will have flag 
f + I (mod 2), which will cause a mismatch, contradicting our assumption. 

Since each step in the two stages of the simulation can be carried out in 
constant time, we see that the entire simulation can be completed in 0{t{n)) 
steps. □ 

Together with the obvious simulation of two stacks by a deque (the stacks 
are stored with bottom markers next to each others at both ends of the deque) 
we obtain: 

Corollary 1. For nondeterministic machines a single deque and two stacks are 
equivalent up to a linear time overhead. 

We can now answer the question from [3] whether nondeterministic machines 
with several tapes working in linear time are stronger than linear-time machines 
with one deque. Informally we obtain that two tapes (which are equivalent to 
many tapes [2]) are stronger than a single deque. 

Corollary 2. There is no general linear time simulation of multiple tapes by 
one deque for nondeterministic machines. 

Proof. Suppose that a linear time simulation of two tapes by a deque exists. For 
the special case of realtime acceptance by two tapes we obtain an upper bound 
of 0{n) on two stacks. Simulating these two stacks by the method of [12] on 
a single tape we obtain an 0(n^ ®-\/log n) upper bound on the time. But this 
contradicts the known almost quadratic lower bounds for the nondeterministic 
simulation of two tapes by one tape, e.g. flir? j log^^^ n) for any positive k from 
[6] (log^^^ denotes the fc-fold iteration of log). □ 
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By calculating the simulation overhead we can even derive the lower bound 
l7(n^/^/i/log n) for the nondeterministic simulation of two tapes by one deque. 

For nondeterministic computations three stacks are as powerful as any finite 
number of tapes [2]. Simulations by three or more stacks are therefore interest- 
ing only in the deterministic setting. In the next section we will consider some 
simulations of this kind. 

4 Deterministic Simulations 

For deterministic simulations of a deque by two stacks we have a less complete 
picture than in the nondeterministic setting. The well-known linear time simu- 
lation of a queue by two stacks (see, e.g., [8,13] or [10, Ex. 2.2.1.14]) generalizes 
to output-restricted deques, while we don’t have an analogous result for input- 
restricted or general deques. 

The question whether there is a realtime equivalence between two stacks and 
an output-restricted deque is answered negatively by the following results. 

Let denote the reversal of string u. Define L C {0, 1, 2, 3, $}* as 

L = {wi$W2$W3xy$z I wi,W2, W3 G {0, 1}*, X G {2, 3}, 

y = W 2 if X = 2, y = if a; = 3, G {0, 1}* : wi = tz^}. 



Lemma 1. There is a deterministic two-stack machine that accepts L in real- 
time. 

Proof. Clearly, the machine can decide membership in the regular language 
{0, 1}*${0, 1}*${0, 1}*{2, 3}{0, 1}*${0, 1}* on the fly. We may therefore assume 
that the input can be decomposed according to the definition of L. The machine 
accepting L pushes copies of rci on both of its stacks and marks these segments 
of its stacks. Then it copies W 2 onto stack 1 and ws onto stack 2. When x is 
encountered it accordingly compares the string stored on stack 1 or stack 2 with 
the input segment y following x. If y matches, the remaining symbols of the 
input after the last $ are compared with the topmost section of the stack which 
has just been emptied (discarding the marker first). □ 

Lemma 2. Deterministic machines with one output-restricted deque cannot ac- 
cept L in realtime. 

Proof. We assume that L can be accepted by a machine M of the kind described 
in the lemma (with a deque that allows deletions only at its rear) and derive 
a contradiction. We simplify the proof by assuming that M executes one step 
per input symbol, which can be achieved by possibly increasing the number of 
internal states and of storage symbols. 

The proof uses the notion of incompressibility, in particular the non-con- 
structive existence of incompressible binary strings of each length, see [14]. We 
will also require some strings of symbols to be self delimiting, which means that 
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their last symbols can be detected even if they are concatenated with other 
strings. A simple (though not very efficient) way to achieve this is to repeat each 
bit except for the last one, which is encoded by one of 01 or 10. 

Let m be the number of different symbols available to M for storing infor- 
mation on its deque. Fix the constant c = max(2, 3[log2 m]). Choose an incom- 
pressible string w of length (c^ -I- 2c -I- l)n for n sufficiently large to make the 
argument work. Divide w into three portions w = W 1 W 2 W 3 with |wi| = (c^-l-c)n, 
|w2| = n, and |w3| = cn. We now consider accepting computations of M on input 
strings with a prefix wi%. Our first claim is that after having read this prefix the 
length £ of M’s deque exceeds 2(c-|- l)n. We can recover wi from the contents of 
the deque by feeding the string $2$ into M (this will encode three empty strings 
W 2 , W 3 , and y) and then looking symbol by symbol for the longest trailing string 
that causes M to accept. Since any string that leads to acceptance is a prefix of 
Wi, this will recover Wi- Now suppose £ < 2(c -I- l)n. The entire string w can 
be described by: 

— A self delimiting description of M of size 0(1). 

~ The state of M after reading wi$ of size 0(1). 

— A self delimiting description of the above algorithm for recovering wi of size 

0 ( 1 ). 

— A self delimiting encoding of £ in O(logn) bits. 

— A self delimiting encoding of n in 0(log n) bits. 

— The contents of M’s deque encoded in binary in at most £ |"log2 to] < | (c^ -I- 
c)n bits. 

~ The strings W 2 and W 3 (no encoding is necessary, since their lengths are 
known) . 

For sufficiently large n this will result in a compression of w contradicting its 
choice. 

Notice that after reading w\% the deque is now longer than the remaining 
suffix of any accepted input of the form we consider with an empty trailing 
portion z. This implies that the first symbol on M’s deque at this point will never 
be removed and nothing added at the head can later influence the computation. 

Consider next the state of M’s deque when wi$W2Sw3 has been read. We 
count the number of trailing positions of these deque contents that have been 
modified while M was reading W 3 . Assume first that this number exceeds n -I- 1. 
Intuitively, M has — up to a finite number of bits — hidden (or even destroyed) 
the encoding of W 2 - If the trailing n -I- 2 symbols on the deque are known, then 
W 2 can be recovered by starting M in its state qi after reading wi$W 2 $W 3 and 
feeding in strings of the form 2u$ with m G { 0, 1}" until the unique string 2 w^$ 
leading to acceptance has been found. The trailing segment of length n -I- 2 of 
the deque can be generated by starting M from the state <72 it was in before 
it accessed the first symbol in this segment for the last time and feeding in the 
portion of W 3 that had not been processed at this point. 

We can now describe w by: 

— A self delimiting description of M of size 0(1). 
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— The states q\ and <72 in 0(1) bits. 

— A self delimiting description of the above algorithm for recovering W 2 of size 

0 ( 1 ). 

— A self delimiting encoding of n in 0(log n) bits. 

~ A self delimiting encoding of the position within W 3 from which M has to 
be started in order to generate the trailing portion of the deque in O(logn) 
bits. 

— The strings wi and IV 3 (again, their length is available). 

For n sufficiently large we obtain a compression contradicting the choice of w. 

The other case is where a trailing string u of at most n + 1 symbols of the 
deque includes all positions modified while reading W 3 . It is dealt with in a 
similar way, now compressing W 3 . □ 

As a consequence of the two preceding lemmas we obtain: 

Theorem 2. Deterministic machines with one output-restricted deque cannot 
simulate two-stack machines (and hence deque machines) in realtime. 

Two stacks cannot recognize the language {x2x' | x G {0, 1}* and x' is a prefix 
of x) by the (stronger) result of [9] in realtime, while even with a queue this 
language can be accepted. Therefore two stacks and an output-restricted queue 
are of incomparable power for deterministic realtime computations. 

Next we consider simulations by more than two stacks. Rosenberg [16] de- 
scribes a linear time simulation of a general deque by three stacks. The idea 
behind the simulation is again to maintain a left and a right section of the deque 
on two stacks. Whenever exactly one of these sections is empty and a delete 
operation on the respective portion is requested, the third stack is used in order 
to partition the non-empty section into segments of approximately half of its 
original size. The analysis of this simulation is somewhat difficult, since items 
may be copied from one section to the other and back an unbounded number of 
times. 

We present here a different linear time simulation of a deque by three stacks, 
which is easier to analyze and has a smaller linear factor in terms of the number 
of push and pop operations required. While Rosenberg’s solution uses at most 
18n stack-operations for simulating n deque-operations, our bound will be 9n. 
We remark that Rosenberg’s simulation is closely related to the construction 
used by Stoss [17] in the simulation of multi-head tape-units by single-head 
ones, while ours is inspired by the very first proof of such a simulation given by 
Hartmanis and Stearns [7]. 

Theorem 3. A sequence of n deque operations can he simulated deterministi- 
cally by three stacks with 9n stack- operations. 

Proof. Like in Rosenberg’s simulation, we represent a deque as two stacks Sl and 
Sr that encode the current contents of the deque. In contrast to the simulation 
outlined above, where (except for intermediate states) the concatenation of the 
two stacks reflects the deque contents verbatim, in our simulation some infor- 
mation may be duplicated or outdated. Therefore some deletions at the bottom 
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of the stacks might be necessary in order to obtain the current contents of the 
simulated deque. The membership of an item in the section possibly outdated 
or duplicated will be expressed by coloring it red, while all other items will be 
green. These colors can either be represented by a single bit attached to each 
item, or by markers on the stacks, since items of the same color form blocks. 
Initially Sl and Sr are empty, as is the third stack St that will hold a counter 
which has the value 0 at the start of the simulation. Insert operations are simply 
mapped to push operations of green items onto Sr or Sr. We will now describe 
a delete operation on the left in detail. The corresponding operation on the right 
works symmetrically. 

If Sr contains an item that is colored green as its topmost item, then it is 
returned. If S'/, has a red item as its topmost and the count maintained on the 
third stack is not 0, then the red item is returned and the counter is decremented. 
Otherwise a clean-up phase is initiated. All remaining items on Sr are deleted. 
The green items on top of Sr are popped one after the other and pushed onto 
Sr as well as onto Sr, changing their color to red. The remaining red items 
on Sr are discarded. The items now on Sr are moved back onto Sr and their 
number is counted on Sr above its red items. Finally the count is transferred to 
Sr. Then the simulation is resumed and the deque operation is carried out, if 
possible. 

In order to see that the simulation is correct, note that the current contents of 
the deque being simulated is encoded by the green items on Sr (top to bottom), 
a top segment of c red items on S'j;, (where c is the value currently stored on Sr), 
or equivalently the mirror image of a top segment of red items on Sr, and finally 
the green items on Sr from bottom to top. This invariant is clearly maintained 
by push operations and pop operations of green items. A pop operation of a 
red item reduces the count and thus adapts the length of the red segments 
considered on both stacks. The clean-up moves a sequence of green items into 
fully overlapping red segments. 

Each item enters in green color, causing one push (1 operation). If it leaves 
the deque while still having green color, the total cost is 2 operations. Otherwise 
it changes its color to red. While doing so, it is moved from Sr or Sr to the 
other stacks (3 operations), one of the copies is moved back from Sr and counted 
(3 operations), and the count is transferred to Sr (2 operations). If it is never 
removed, then both copies stay in the stacks and the total cost is 9 operations. 
Otherwise the item is removed and possibly also its other copy (2 operations). 
In addition the count is updated (I operation). In the latter case the total is at 
most 12 push or pop operations for simulating two operations, which yields 6 
operations per simulated deque operation. □ 

In the simulation given in the previous proof red items can be left on the 
stacks in the clean-up phase (which then doesn’t really clean-up), saving n op- 
erations at the cost of wasting stack space. Also, the counter values could be 
reduced by a linear factor, which would save (3 — e)n operations for any e > 0, 
an improvement which can be applied to Rosenberg’s simulation as well. 
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Abstract. We refine and optimize a computationally intensive enumer- 
ation method, based on the traversal of a quadtree, for finding lower 
bounds on the lengths of reflecting sequences for labeled chains. The im- 
provement results from the introduction of a redundancy relation defined 
on vertex-pairs of the underlying quadtree, which enables the pruning of 
redundant branches near the root of the quadtree, as well as locally at 
deeper depths. The test run of the implementation showed a length lower 
bound of 19t — 214 for t-reflecting sequences for labeled 7-chains with sig- 
nificant speedup, which yields the current length lower bound 
for universal traversal sequences for 2-regular graphs of n vertices, and 
for universal traversal sequences for d-regular graphs of 
n vertices, where 3 < d < ^ -I- 1. 



1 Preliminaries 

Reflecting sequences are variants of universal traversal sequences, and were intro- 
duced by Tompa [9] in proving lower bounds on the lengths of universal traversal 
sequences. The study of universal traversal sequences is motivated by the com- 
plexity of graph traversal. Good bounds on the lengths of universal traversal 
sequences translate into good bounds on the time complexity of certain undi- 
rected graph traversal algorithms running in very limited space. 

Aleliunas et al. [1] proved the existence of polynomial-length universal traver- 
sal sequences, yielding polynomial-time and logarithmic-space (non-uniform) de- 
terministic algorithms for undirected graph traversal. Beame et al. [2] used vari- 
ants of the jumping automaton for graphs of Cook and Rackoff [6] to study 
time-space tradeoffs for undirected graph traversal. Good lower bounds on the 
length of universal traversal sequences provide a prerequisite to proving time- 
space tradeoffs for traversing undirected graphs for these computational models. 



I. 1 Universal Traversal Sequences for Undirected Graphs 

For a graph G, denote by V{G) the vertex set of G, and by E{G) the edge set 
of G. For positive integers d and n such that d < n, let Q{d,n) be the set of all 
connected d-regular graphs of order n with an edge-labeling. The edge-labeling 
on G G t/(d, n) is defined as follows: For every edge {u, u} G E{G), there are two 
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labels lu,v,lv,u G {0, 1, . . . , d — 1} with the property that for every u G V{G), 
{lu,v I {u,v} G E{G)} = {0, 1, . . . , d — 1}. Note that Q{d,n) is not empty if 
and only if dn is even. A sequence U G {0, 1, . . . , d — 1}* can be considered 
as a sequence of edge-traversal commands and induces a walk starting at any 
vertex in every labeled graph in Q{d, n). We say that such a sequence U traverses 
G G Q{d, n) starting at uq G F (G) if, by starting at vq and following the sequence 
of edge labels in [/, all the vertices in G are eventually visited. The sequence 
U is called a universal traversal sequence (UTS) for Q{d,n) if U traverses every 
G G G{d, n) starting at every vertex in G. 

Let U{d,n) denote the length of a shortest UTS for non-empty G{d,n), and 
define U (d, n) = U(d, n-l-1) in case Q{d, n) is empty. The lower and upper bounds 
on U{d,n) for various ranges of d were studied in [1], [3], [4], [7], [9], [5], and [8]. 
Currently, the best lower bounds on U (d, n) are in [8] , 

TTt^ Jf?(n'°S7i9) ifd=2, 

G (.d, n) I f2{d^-iog, i9j^H-iog, 19) Jf 3 < ^ ^ ^ _ 

which were obtained using a computationally intensive approach. 

1.2 Reflecting Sequences for Labeled Chains 

For positive integer n, a labeled chain of length n is a graph G with vertex set 
V{G) = {0,1,..., n} and edge set E{G) = {{i,i + 1} | 0 < i < n} with an 
edge-labeling defined as follows: Every edge (i, i -I- 1} G E{G) is associated with 
two labels h^i+i and each a non-empty subset of (0, 1}, such that 

f- ^0,1 — ln,n—i — {0, 1{, and 

2. li^i-i and k^i+i form a partition of {0, 1} for all 0 < t < n. 

Denote by C{n) the set of all labeled chains of length n. A labeled chain 
G G C{n) can be identified with the sequence a = G {0,1}"“^, 

where = {cti} for 0 < i < n. 

Given a labeled chain G G £{n), every sequence U = U\U 2 - ■ - Uk G {0,1}^ 
where fc > 0, when considered as an edge-traversal sequence starting at vertex 
0 in G, determines a unique sequence (0 = Uq, ui, U2, . . . , Vk) G (0,1,..., 
such that Ui G lvi_^,vi for all 1 < t < fc. For positive integer t, a sequence 
U G {0, 1}* is said to reflect t times in G G £{n) if the walk in G induced by 
U alternately visits the end-vertices n and 0 at least t times; that is, there exist 
0 < < Z 2 < • • • < it < |C/| such that Ui 2 j_i = n for all 1 < j < [ and = 0 

for all 1 < j < [|J. The sequence U is called a t-reflecting sequence for £(n) if 
U refiects t times in every G G £{n). Denote by R{t,n) the length of a shortest 
t-refiecting sequence for £{n). 

The following theorem reduces the length lower bounds for universal traversal 
sequences to those for t-refiecting sequences. 

Theorem 1. [8] Suppose that there exist positive integer c > 2 and positive reals 
r >2 and k such that, for every positive integer t, R{t, c) > rt — k. Then 

r G(n^°s<=’-) ifd=2, 

{ G(d2-logc7„l+log,r) if3<d<jt^ + l. 



U {d, n) 
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In the light of Theorem 1, we can obtain length lower bounds for universal 
traversal sequences by seeking constants c, r, and k such that every t-reflecting 
sequence for C{c) has length at least rt—k, in which a large log^ r translates into 
a good length lower bound. The lemma below shows that it suffices to consider 
only the case when t is odd. 

Lemma 1. [8] For all positive integers t and c and positive reals r and k, if 
R{2t—l,c) > 2rt — k for every positive integer t, then Rft' , c) > rt' — k for every 
positive integer t' . 

2 The Quadtree Enumeration Method 

To show that R{2t — 1, c) > 2rt — k in Lemma 1, we assign a minimum of 
“marks” to various positions in a hypothetical {2t — l)-reflecting sequence S {t 
marks for each labeled chain of length c). We argue that the “marks-to-bits” 
density for all but a short suffix of S must be at most , implying a minimum 
length of 2°“^t/(2°“^/2r) = 2rt for all but a short suffix of S having a bounded 
number of marks. We then determine k, depending on the maximum number of 
marks in the suffix, which gives that R{2t — 1, c) > 2rt — k. 

Following the framework developed and analyzed in [8], it suffices to con- 
struct a tree T that can be used to divide every marked S into even-length seg- 
ments, such that all but possibly the last segment has a marks-to-pairs density 
at most . Specifically, T is a quadtree whose vertices represent potential even 
bit-positions in a hypothetical (2t — l)-reflecting sequence for £(c), and whose 
branches from a position are labeled by the four possible pair-continuations at 
that position, that is, 00, 01, 10 and 11. 

We summarize the Quadtree Enumeration method [8] in the following algo- 
rithm. 

algorithm Build.Quadtree (c, p) 

1. Initialize S to the pair 00; 

2. (* Only examine 00-branch by symmetry. *) 

While S ^ 01 do 

2.1. Try to find a prefix (pair-sequence) of S whose marks-to-pairs 
density is at most p; 

2.2. If no such prefix exists, then 

2.2.1. Extend the quadtree by appending the pair 00 to S', 

else 

2.2.2. (* The sequence S represents a leaf. *) 

Replace S with the pair-sequence no longer than S and 
following S in the lexicographic ordering of pair-sequences; 

end Build-Quadtree 

To determine in step 2.1 an upper bound on the marks-to-pairs density of a 
sequence S, we assume the maximum number of marks possible for each chain 
in C{c), which can be obtained by assuming the traversal of the chain starting 
at vertex 1 with the traversal sequence S. Hence, for each Ci G F(c), where 
i G {0'^“^, . . . , we traverse Ci starting at vertex 1 with S, and place 

two types of marks Mi on the pairs of S' as follows. 

1. Closed marks of Mi on S that correspond to the last exit from vertex 1 
during every traversal from vertex 0 to vertex c with S, and 
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2. An open mark of Mi on S that corresponds to the last exit from vertex 1 
during a traversal from vertex 0 without visiting vertex c with S. Equiva- 
lently, the mark Mi on S is not closed on S but it becomes closed on SS' 
for some sequence S' £ {0, 1}*. 

We compute the desired upper bound in a graph-theoretic setting. Informally, 
for a marked traversal sequence S above, we construct an inconsistency graph 
Gc(S) of S, whose vertex set represents all possible open marks on S and edge 
set represents all pairwise inconsistencies between open marks on S. 

The algorithm for deciding if a particular S constitutes a leaf in the quadtree 
is summarized below. 

algorithm Is_Leaf {S, c, p) return boolean 

1. For each i G place closed and open marks of Mi on S; 

2. For each non-empty prefix (pair-sequence) S' of S, 

2.1. Form the inconsistency graph Gc{S') of open marks on S, and solve 
the Maximum Independent Set problem for Gc(S'); 

2.2. Compute the sum m of the number of closed marks on S' (in step 1) 
and the independence number of Gc{S') (in step 2.1); 

2.3. If m/ < p, then return true; 

3. Return false; 
end Is_Leaf 

A successful termination of Build_Quadtree (c, gives the supposition in 
Lemma 1. 

Theorem 2. [8] If Build-Quadtree{c, - — ) successfully terminates for some 
positive integer c > 2 and positive real r > 2, then the supposition (hence the 
conclusion) is achieved. 

3 Refinements and Optimizations 

In step 2 of the algorithm Is_Leaf, each upper-bound computation for the marks- 
to-pairs density of every non-empty prefix S' of segments S' of a hypothetical 
reflecting sequence induces an instance of the Maximum Independent Set prob- 
lem for Gc(S'). We can encompass all these instances with a unified approach: 
each Gc(S') is an induced subgraph of the same inconsistency graph Gc, which 
captures all (pairwise) inconsistencies between all possible pairs of open marks 
on a traversal sequence. 

For a positive integer c, the inconsistency graph Gc is defined as follows. The 
vertex set of Gc is {(Gq, m) | a G {0°“^, . . . , 1°“^} and u G {2, 3, . . . , c — 

1} is odd}, in which a vertex (Gq,u) represents an open mark Mq, on a non- 
empty traversal pair-sequence with the terminating vertex u (hence 1 < u < c 
and u is odd). The edge set of Gc represents pairwise inconsistencies between 
all possible pairs of ((Gq,m), (G/s,v)) G V{Gc) x V{Gc) from one trivial and two 
non-trivial sources. 

The trivial source of inconsistency results from that each traversal pair- 
sequence yields at most one open mark M^ for each a G 0"'“^!, . . . , 

so we have ((Ga, u), (Gq, u)) G E{Gc) for all (Gq, w), (Gq,, v) gV (Gc) with u^v. 
The two non-trivial sources are suffix-inconsistency and prefix-inconsistency. In- 
formally, the suffix-inconsistency reffects the co-existence of two open marks. 
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while the prefix-inconsistency represents the co-reachability of the open marks 
from their initial configuration. 

For a € {0, 1}*, i.e., a = aia 2 • • • On for some n > 0, where at G {0, 1} for 
t = 0, 1, . . . , n, let a(i, j) denote aiOj+i ■ ■ ■ aj if z < j, and the empty sequence 
otherwise. We denote by a the reversal of the component-wise complement of 
a, that is, a = oF a„_i • • • cFT- To study the suffix-consistency of {Ca,u) and 
(C/j, v) for two labeled chains Ca, Cj^ G £(c) and two vertices u,v G {1, 2, . . . , c} 
in Ca and C/j, respectively, we consider S(u, v) = (c—u) — (w— 1), which measures 
the difference of distances of u and v from vertices c and 1 in Ca and Cfj, 
respectively. 

The theorem below characterizes the suffix-inconsistency between (Ca,u), 
{C/ 3 , v) G £(c) X {2, 3, . . . , c — 1} via S(u, v) in the number of non-refiecting bits 
in a(2, c—l — S{u, v)) versus /3(2, c—l — S{u, v)). We define a first-order predicate 
NRc,a ,/3 for the maximum number of non-refiecting bits in such a,(3 G {0, 1}°“^ 
inductively as follows: 



NRc,a,fd{i) 



(3{2, c — 1 — i) = a{2, c — 1 — i) if z is either 0 or 1 , 

(/3(2, c — 1 — z) = a(2, c — 1 — z))V 

3ji G {2, 3, . . . , c 1 “ o; 2 +c—i—i—ji 

ANRc^a,0{i ~ 2)) if z > 2 . 



Theorem 3. Let {Ca, u), {Cp, v) € £{c) x {2, 3, . . . , c — 1}, we have: 

1. If 6{u,v) < 0, then {Ca,u) and {Cf 3 ,v) are suffix- consistent. 

2. If 5{u,v) > 0, then {Ca,u) and {Cf 3 ,v) are suffix-inconsistent if and only if 
NRc,a,(}{5{u,v)). 



3.1 Pruning Redundant Branches in the Quadtree 

Every labeled chain C G £{c) has an “ideal walk” starting at vertex 1 that 
represents the densest packing of reflections (marks) that C can exhibit; and that 
sequence of vertices visited depends only on c: 1, 2, ... , c, c— 1, c— 2, ...,1,0,1,.... 
We can measure the goodness of every walk in a labeled chain C starting at vertex 
1 by how much of the ideal walk in C starting at vertex 1 is covered. 

We define the notion of weight formally as follows. Let W{C,S) denote the 
walk induced by a traversal sequence S G {0, 1}* in a labeled chain C G £(c) 
starting at vertex 1. A low reflection at vertex 0 represents a traversal from vertex 
c to vertex 0 with S; and a high reflection at vertex c is defined analogously. 
For a walk W(C, S) terminating at vertex v, the ideal walk of W(C, S), denoted 
by Ideal{W{C,S)), is the ideal walk in C starting and terminating at vertices 
1 and V, respectively, which has the same number of reflections as W{C,S). 
The weight, wt{C, S), of a walk W{C, S) is the length of the vertex-sequence of 
Ideal{W{C, S)). 

The following two theorems justify the pruning of redundant branches in the 
quadtree. 
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Theorem 4. For a labeled chain C G £(c) and two traversal sequences Si and 
S2 of the same length-parity, ifwt{C, S'!) < wt{C, S2), then for every S G {0, 1}*, 
wt{C,SiS) < wt{C,S2S). 

Proof. Omitted. | 



For aia2 • • • On G {0, 1}”, where 

TT ^ 0, (1) 7Ti(q;i0203G!4 * * * CXn) — Oi020;3Q;4 * * * 

where a denotes the complement of a G {0, 1}, and a'„ is if n is odd, and 
oil otherwise, (2) 7r2(aia2<a3a4 • • • a„) = c^a 2 <^a 4 ■ ■ ■ a", where a" is olf if n 
is odd, and a„ otherwise, and (3) tts = 7T2 o tti. 

A more general form of redundancy is obtained via the functions tti, 7T2, and 
7T3. Let l{o,i}* denote the identity function on {0,1}*, and II = |l{o,i}* j7’‘i77r2,7T3}. 

Theorem 5. Let Ci he a labeled chain in C{c), where i G . . . ,1'^“^}, 

and 7T G 77. For two traversal sequences Si and S2 of even length-parity, if 
wt{Ci,Si) < wt{CT^(^i), S2) , then for every S G {0,1}*, 
wt{a,SiS)<wt{C^(^ipS 2 Tr{S)). 

Proof. We modify the induction argument in the proof of Theorem 4 as follows. 
For the induction step, suppose that the statement in the theorem is true for 
every S G {0, 1}* with [S'! < n, where n > 0. Consider a sequence S' G {0, l}"'*'^, 
and assume that wt{Ci, Si) < wt{C,^(^ip S2). Consider the two cases for the parity 
of n. 

The case of even n: We write S' = Sa, where S G {0, 1}” and a G {0, 1}. Ap- 
plying the induction hypothesis on S, we have wt{Ci, SiS) < wt{CT^(^ip S2 Tt{S)). 
Note that tt{S') = Tr(S'a) = ■k{S)'k{o). 

If wt{Ci, SiS) = wt{CT,.(^ip S2 '!t{S)), then the terminating vertices of the two 
walks W{Ci,SiS) and «5'2T!'(5')) have the same vertex-label — with 

odd parity due to the even length-parity of SiS and S2Tt{S). Starting at these 
two vertices with the same odd vertex-label in Ci and the sequence of 

vertices visited in Ci induced by a traversal sequence T G {0, 1}* is identical 
to the sequence of vertices in induced by the sequence 7r(T). Thus the 

traversal sequences SiSa and S'27r(S')7r(a) in Ci and (7,^(1)) respectively, satisfy 
that wt{Ci, SiSa) = ■u;t(C'7r(q, S'27r(S')7r(a)). 

If wt{Ci, SiS) < wt(C'7r(i), S'27r(S')), then we have 
wt{C„ SiS) + 2 < wt{C^(,),S2TT{S)) 

as S'! and S2 are of the same length-parity. Notice that |wt(C'i,S'iS'a)— rct(C'i,S'iS')| 
< 1 and \wt{CT^i^ip S2T^{S)Tr{a)) — wt{CT^[ip S2T^{S))\ < 1, we have 

wt{Ci, SiSa) < wt{Ci, SiS)-\-l < wt{Cj^(^ip S2 Tt{S)) — 1 < >5'27!‘(S')7r(a)) . 

The case of odd n: We write S' = Saia2, where S G {0,1}”“^ and 04,02 
G {0,1}. Applying the induction hypothesis on S, we have wt{Ci, SiS) < 
w^^(C'7r(i), -S'27 t(S')). Notice that tt{S') = 7r(S'oi02) = 7r(S')7r(oi02). We proceed 
as in the previous case, and the sub-case when assuming that wt{Ci, SiS) = 
wt(C,r(i)) >5'27’‘(S')) is similar. 
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We now assume that wt{Ci, SiS) < S'27t(5')). As the two sequences 

SiS and 5'27 t(S') are of even length-parity, we have wt{Ci, SiS) + 2 < 
wt(C,r(i), 5'27 t(S')), and both terminating vertices of the walks W{Ci,SiS) and 
kh(C 7 r(i), S'27r(S')) (W{Ci,SiSai) and W(C^(j), S'27r(S')7r(ai))) have odd (even, 
respectively) vertex-labels. Since \wt{C,Ta) — wt{C,T)\ < 1 for all labeled 
chains C G >C(c), T G {0,1}*, and a G {0,1}, we obtain that wt{Ci, SiSai) < 
S'27r(S')7r(ai)) and its strict inequality will give the desired result. 
Consider now that wt{Ci,SiSai) = rct(C 7 r(i), S'27r(S')7r(ai)). Both walks 
W(Ci, S'lS'ai) and W(C^(j), S'27r(S')7r(ai)) terminate at vertices with the same 
even vertex-label v in Ci and CTr{i), respectively. If v is either 0 or c, we obtain 
the desired result. Let and 7r(i)^ denote the bits of the labels i and 7r(z) 
of Ci and C^(q, respectively — the “forward” edge-labels associated with vertex 
V in Ci and respectively. Let denote the last bit of 7r(aia2). We can 

verify that z„ = 02 if and only if 7r(z)„ = a^. Thus both walks W(C'i, S'iS'aia 2 ) 
and S'27r(S')7r(ai)a2) (= S'27r(S')7r(aia2))) terminate at ver- 

tices with the same vertex-label in Ci and C^(q, respectively. Hence 
wt{Ci,SiSaia2) = zct(C'^(i), S'27r(5')7r(aia2)). 

Combining the two cases for the parity of n, we have wt{Ci,SiS') 
< zct(C 7 r(i), S'27r(S")), as desired. This completes the induction step. By induc- 
tion, the theorem is proved. I 

For two traversal sequences S'! and S2 of even length-parity and |S'i| > |S' 2 |, if 
for some tt G iT, wt{Ci, S'!) < ict(C'^(q, <S' 2 ) for each i G {0°“^, . . . , 1““^}, 

then by Theorem 6, for every S G {0, 1}*, wt{Ci, SiS) < zct(C,r(i), 5'27 t(S')) for 
each z G {0°“^, . . . , 1““^}. This gives that pi(SiS) < /9i(5'27r(S')) for every 

S G {0, 1}*. Thus S'! is a redundant branch to explore. 

We define a binary redundancy- ordering relation <r on the set of even-length 
sequences of {0, 1}* as follows. For 81,82 G {0, 1}* of even length, 81 <r 82 if 
and only if |S'i| > |S' 2 |, and for some tt G 7T, wt{Ci, ^i) < S' 2 ) for each 

z G {0““^, . . . , H“^}. Note that <r is a preorder (that is, a reflexive and 

transitive binary relation), and every pair of sequences of length 2 are related 
under <^. We use the <j.-redundancy relation to prune the branches near the 
root of a quadtree, as well as locally at deeper depths. 

3.2 The Shared Tree of Non-redundant Sequences 

For a given chain length c and a pre-determined positive integer A, we generate 
a depth- Z\ tree Ta, consisting of all non-redundant pair-sequences (that is, pair- 
sequences maximal respect to <r) of length at most 2Z\, inductively level by 
level. Suppose that we have constructed the tree Ts of all non-redundant pair- 
sequences of length at most 26, where 1 < <5 < A. Consider the set Ls+i of all 
immediate descendant pair-sequences of the leaves of Ts- Denote by M{A) the 
set of all <r-maximal elements in a set A of sequences of {0, 1}*. We compute 
the set M{V{Ts) U Ls+i) as follows: 

1. Employ a simple divide-and-conquer algorithm to compute the set 
and 
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2. Compute the set M{V {Ts)ULs+i) by examining all pairs in V{Ts)xM{Ls+i) 

for <r-redundancy. 

The non-redundant sequences in the shared tree are saved in a file and 
read each time the algorithm Build_Quadtree is using the chain length c. Before 
a pair-sequence of length at most 2Z\ is examined, it is first looked up in the non- 
redundant sequence list; and if it is redundant, the sequence is not examined. 
At depth 6 that exceeds the pre-determined A, we perform small-scale local 
redundancy analysis to the subtrees rooted at depth S according to the < 1 -- 
redundancy relation. 

3.3 Local Trees of Non-redundant Sequences 

For a given chain length c and a pre-determined positive integer A' , and a pair- 
sequence S of length 2{A+iA'), where i > 0, we can cover S with the shared tree 
T/i and a sequence/stack of local trees ■ ■ ■ ,ti, each of which is a depth-Zl' 

tree of locally non-redundant pair-sequences rooted at a prefix of S. 

We can construct the sequence of local trees inductively on f in a manner 
analogous to the construction of the shared tree Ta described above. Before the 
local tree ti+i is generated, the pair-sequence at its root is compared to the non- 
redundant sequence lists of . . . ,ti; and if it is redundant, it is treated as 

a leaf. (In our current implementation, only the non-redundant sequence list of 
ti is included for comparison.) 

4 Conclusions 

We have implemented the Quadtree Enumeration method with the improve- 
ments outlined in previous section (implementation is available upon request 
from the author), and have run the algorithm using a chain length of c = 7, 
a marks-to-pairs density upper bound of p = f|, a <r-redundancy shared 
tree of depth Z\ = 10, and local trees of depth A' = 4. This has resulted in 
A(t, 7) > 19t — 214 and the length lower bounds for universal traversal se- 
quences as in [8] with significant speedup, excluding the computation time of 
the <j.-redundancy shared tree. 

Our current effort is to employ the Quadtree Enumeration method incor- 
porating the above-mentioned refinements and optimizations for a larger chain 
length of 9. Future work is directed at finding fast algorithms for solving the 
embedded Maximum Independent Set problems for induced subgraphs of incon- 
sistency graphs, and at improving the construction of the <r-redundancy table. 
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Abstract. We consider a competitive facility location problem with two 
players. Players alternate placing points, one at a time, into the play- 
ing arena, until each of them has placed n points. The arena is then 
subdivided according to the nearest-neighbor rule, and the player whose 
points control the larger area wins. We present a winning strategy for 
the second player, where the arena is a circle or a line segment. 



1 Introduction 

The classical facility location problem [5] asks for the optimum location of a 
new facility (police station, super market, transmitter, etc.) with respect to a 
given set of customers. Typically, the function to be optimized is the maximum 
distance from customers to the facility — this results in the minimum enclosing 
disk problem studied by Megiddo [8], Welzl [12] and Aronov et al. [2]. 

Competitive facility location deals with the placement of sites by compet- 
ing market players. Geometric arguments are combined with arguments from 
game theory to see how the behavior of these decision makers affect each other. 
Gompetitive location models have been studied in many different fields, such as 
spatial economics and industrial organization [1,9], mathematics [6] and opera- 
tions research [3,7,11]. Gomprehensive overviews of competitive facility locations 
models are the surveys by Friesz et al. [11], Eiselt and Laporte [3] and Eiselt et 
al- [4]. 

We consider a model where the behavior of the customers is deterministic in 
the sense that a facility can determine the set of customers more attracted to it 
than to any other facility. This set is called the market area of the facility. The 
collection of market areas forms a tessellation of the underlying space. If cus- 
tomers choose the facility on the basis of distance in some metric, the tessellation 
is the Voronoi Diagram of the set of facilities [10]. 

We address a competitive facility location problem that we call the Voronoi 
Game. It is played by two players, Blue and Red, who place a specified number, n, 
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of facilities in a region U . They alternate placing their facilities one at a time, 
with Blue going first. After all 2n facilities have been placed, their decisions 
are evaluated by considering the Voronoi diagram of the 2n points. The player 
whose facilities control the larger area wins. 

More formally, let and be the respective locations of the blue 

and red points and set 

B = |{m G U : imnd{u,bi) < min d(u, ri)}|, 

i i 

TZ=\{u^U : min d(u, r^) < min d(u, &i)}| 

I i 

where d{u,v) is an underlying metric and | • | indicates the area of a set. Blue 
wins if and only ii B > TZ, Red wins if and only if 7^ > and the game ends in 
a tie if B = TZ. 

The most natural Voronoi Game is played in a two-dimensional arena U using 
the Euclidean metric. Unfortunately nobody knows how to win this game, even 
for very restricted regions U. In this note we present strategies for winning one- 
dimensional versions of the game, where the arena is a circle or a line segment, 
and variations. In other words, we consider competitive facility location along 
an Australian highway. 

The next section discusses the simplest game, on the circle. It is obvious that 
the second player. Red, can always achieve a tie by playing on the antipode of 
Blue’s move. One might try to tweak this strategy such that it results in a win 
for Red. This doesn’t seem to work, and we present instead a quite different 
winning strategy for Red. 

Section 3 describes how this strategy remains a winning strategy even if the 
rules of the game are drastically relaxed. 

In Section 4 we finally turn to the line segment arena. It would appear that 
Blue has an advantage here, because it can play the midpoint of the segment in 
its first move. We show that this doesn’t help, and prove that Red still has a 
winning strategy. The strategy is quite similar to the one for the circle case, but 
its analysis (because of a loss of symmetry) is more detailed. 

2 The Basic Circle Game 

There are two players. Blue and Red, each having n points to play, where n > 1. 
They alternate placing these points on circle C, with Blue placing the first point. 
Red the second. Blue the third, etc., until all 2n points are played. We assume 
that points cannot lie upon each other. Let be the locations of the blue 

points and be those of the red ones. After all of the 2n points have been 

played each player receives a score equal to the total circumference of the circle 
that is closer to that player than to the other, i.e.. Blue and Red have respective 
scores 



B = \{x € C : min d{x, bi) < min d{x, ri)}| 

i i 

TZ=\{x & C : min d{x, Vi) < min d{x, bi)}\ 
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The player with the highest score (the larger circumference) wins. 

The question that we address here is, Does either player have a winning 
strategy and, if yes, what is it? We will see below that the second player, Red, 
always has a winning strategy. 

Before giving the strategy we introduce some definitions. We parameterize 
the circle using the interval [0, 1], where the points 0 and 1 are identified. Arcs 
on the circle are written as [x,y\ implying the clockwise arc running from x to 
y, as in [.5, .6] or [.9, .1]. 

Definition 1. The n points Ui = i = 0, 1, . . . , n — 1 are keypoints. 

We call an arc between two clockwise consecutive red/blue points an interval. 
The interior of an interval is free of red/blue points. At any given time during 
the game the circle is partitioned into intervals. An interval is monochromatic if 
its endpoints have the same color, and bichromatic if they have different colors. 
A blue interval is a blue monochromatic one, a red interval a red monochromatic 
one. We denote the total length of all red intervals by Rm, and the total length 
of all the blue intervals by An interval is called a key interval if both of its 
endpoints are keypoints. 

The important thing to notice is that at the end of the game the length of 
each bichromatic interval is divided equally among the two players, so TZ — B = 
Rm — Bm and Red wins if and only if Rm > Bm- We devise our strategy to force 
this to happen. 

Since we can parameterize the circle arbitrarily, we can assume without loss 
of generality that Blue plays his first point on 0 and thus on a keypoint. We now 
describe Red’s winning strategy. Figure 1 shows an example. 

Red’s Keypoint Strategy 

Stage I: If there is an empty keypoint then Red plays onto the keypoint. 
Stage I ends after the last keypoint is played (by either Red or Blue). 

Stage II: If there is no empty keypoint and it is not Red’s last move 
then Red plays her point into a largest blue interval. We call this 
breaking the blue interval. 

Stage II ends when Blue plays his last point. 

Stage III: Red’s last move. There are two possibilities: 

(i) if there exists more than one blue interval then Red breaks a 
largest one by placing her point inside. 

(ii) if there is only one blue interval define £ < ^ to be its length. 
Red’s move is to go to a bichromatic key interval and claim a 
red interval of length larger than £ by placing a red point closer 
than ^ — i to the blue endpoint of the bichromatic key interval. 

The two following lemmas will be needed. 

Lemma 1. Let B be a set of b blue points and let R be a set of r red points 
currently on the circle with b > r. Let n{R) be the number of red intervals they 
form and n{B) the number of blue ones. Then n{B) — n{R) = b — r. 
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Stage II 




Fig. 1. There are four points to be played for both Blue and Red. The white 
dots represent Blue’s points and the black dots represent Red’s points. We label 
the dots in chronological order. 

Proof. The proof will be by induction on r. If r = 0 then b blue points form b 
blue intervals so n{B) = b, n{R) = 0 and the condition n(B) — n{R) = b — r is 
satisfied. 

Now suppose that the lemma is true for all configurations of b blue points 
and r — 1 red ones. Deleting any red point p from R leaves b blue and r — 1 red 
points, so n{B) — n{R — {p}) = b — r + 1. We now add the red point p back into 
the configuration and ask how the monochromatic intervals can change. There 
are three possible placements of p : 

(i) inside a red interval, increasing n{R) by one and leaving n{B) unchanged. 

(ii) inside a blue interval, decreasing n{B) by one and leaving n(i?) un- 
changed. 

(iii) inside a dichromatic interval, increasing n{R) by one and leaving n{B) 
unchanged. 

After all three of these cases we find that n{B) — n{R) = n{B) — n{R — {p}) — 
1 = 6 — r. 

Lemma 2. Suppose that all n keypoints are covered and Blue has just moved 
(possibly covering the last keypoint). If there is only one blue interval and this 
interval has length < then there exists a bichromatic key interval. 

Proof. We apply the pigeon hole principle: At most 2n — 1 points have been 
played, n of them on keypoints. Consider the n circle arcs of length ^ formed 
by the n keypoints. Since the blue interval has length < ^, at least one of its 
endpoints is inside an arc. That leaves only n — 2 points to have been played 
inside the n — 1 remaining arcs. Therefore, one of the arcs must be free of points, 
forming a key interval. Since there is only one blue interval, there is no red 
interval by Lemma 1. Therefore, this key interval is bichromatic. 

Theorem 1. The keypoint strategy is a well-defined winning strategy for Red. 

Proof. We start with a simple observation. Since the circle contains only n key- 
points and Blue’s first move covers the first keypoint. Red will play onto at most 
n — 1 keypoints. Thus Stage I always ends before Red plays her last point. 
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Consider Stage II. Lemma I implies that after each play by Blue (& = r + 1) 
there is always at least one blue interval on the circle, so Stage II of the strategy 
is indeed well defined. 

We make two observations concerning the situation after Stage II, when Red 
has played her n — I’st point. The first is that there is no blue key interval. Let 
k be the number of keypoints played by Blue during the game. Red has covered 
the remaining n — k keypoints by the end of Stage I. If fc = 1 (the only case in 
which Red skips Stage II), then there certainly is no blue key interval as there 
is only one blue keypoint. When fc > 1, Blue can define at most k — 1 blue key 
intervals with its k keypoints (since Red has at least one keypoint). Note that 
since all keypoints are played by the end of Stage I, all intervals in Stages II 
and III have length at most In particular a blue key interval is longer than 
any other blue interval. Since Red plays k — I points in Stage II, all blue key 
intervals are broken during Stage II. 

The second observation concerning the situation after stage II is that all red 
intervals are key intervals. This statement is true at the end of Stage I, as Red has 
so far only played onto keypoints, and all keypoints are covered. During Stage II, 
Red uses her points to break blue intervals, and therefore creates bichromatic 
intervals only. Blue cannot create red intervals, and so, at the end of Stage II, 
all red intervals are indeed key intervals. 

We now show why Stage III is well defined and why Red wins. Suppose that 
Blue has just played his last point and it is now time for Stage III, Red’s last 
move. From Lemma 1 we know that n{B) > 1. 

If n{B) > I before Red’s last move then the strategy is well defined: Red 
breaks a largest blue interval. This decreases n{B) by 1 so the game ends with 
n{B) > 1. By Lemma 1 we have n(i?) = n{B) > 1. But now note that from 
the observations in the preceding paragraphs all existing red intervals are key 
intervals while all existing blue intervals have length strictly less than Since 
all red intervals are longer than all blue intervals and there are the same number 
of red ones and blue ones we find that Rm > Bm and Red wins. 

If n{B) = I before Red’s last move the strategy requires that the unique blue 
interval has length < ^, and that there exists a bichromatic key interval. The 
first fact was already observed above, the second fact follows from Lemma 2. 

After Red places her last point Blue still has one blue interval of length £ 
while Red has one red interval of length > £. Thus Rm > £ = Bm and Red wins. 

3 A Modified Circle Game 

The basic game can be modified in many different ways. The simplest modifica- 
tion allows the players to play more than one point at a time. More complicated 
modifications permit the players (both or one) to choose before each turn, how 
many points they play. 

Suppose that there are k < n rounds. Let /3i and be the numbers of points 
that Blue and Red play respectively in round i. Suppose that the following 
restrictions are placed. 
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- VI < i < fc, /3j,7i > 0. 

- VI < j < fc, Y.l=i Pi > ELi 7*- 
-El^P^ = El^l^ = n. 

- Pi < n. 

Then Red still wins by following exactly the same strategy as in the previous 
section of first filling in the keypoints and then breaking the largest blue intervals 
until Red plays it’s last point when it follows the Stage III rules. The proof that 
the strategy is well defined and wins is almost exactly the same as the one in 
the previous section so we will not repeat it here. 

Note that this generalization includes both the original game and the “batched” 
version in which each player plays the same number (> 1) of points at each turn. 
Note, too, that k, Pi, and 7^ need not be fixed in advance. For example. Blue 
may decide at every move how many points he will play and then Red plays the 
same number. 

We conclude this section by noting that the condition Pi < n \s essential 
since otherwise Blue would play the keypoints, forcing a tie. 

4 The Line Segment Version 

We now move on to the version of the game played on a line segment. We 
consider it to be horizontal and parameterized as [0, 1]. The scoring is the same 
as in the basic circle game except that the player with the leftmost point claims 
everything between 0 and the point, and the player with the rightmost point 
claims everything between the point and 1. We assume that n > 1, and points 
cannot lie upon each other. When n = 1, Blue wins by placing onto 

We modify some of the old definitions and introduce new ones: 

Definition 2. The n points Ui = ^ + 7 , f = 0,l,...,n— 1 are keypoints. 

The left segment is the segment from 0 to the leftmost red or blue point. The 
right segment is the segment from the rightmost red or blue point to 1. The border 
interval is the union of the left and right segments. An interval is a section of 
the line segment with red/blue endpoints and no red/blue points in its interior. 
We consider the border interval an interval. An interval, including the border 
interval, is monochromatic if its endpoints have the same color, and bichromatic 
if they have different colors. With this definition of intervals. Lemmas 1 and 2 
are true for the line segment as well. 

We denote the total length of all of the blue intervals — including, if appropri- 
ate, the border interval — by Bm, the total length of all of the red intervals — again 
including, if appropriate, the border interval — by Rm- When the border interval 
is bichromatic, we use to denote the length of the left/right segment with 
a blue endpoint and Rb to denote the length of the left/right segment with 
a red endpoint. If the border interval is monochromatic, then Bb = Rt = 0. 
Since all bichromatic non-border intervals are equally shared by both players 
TZ — B = {Rm + Rb) — {Bm + Bb) and, as in Section 2, we design our strategy so 
that Red finishes with the right hand side of the equation > 0. 
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We now introduce the line strategy, a modified version of the circle strategy. 

Figure 2 shows an example. 

Red’s Line strategy 

Stage I: If there is an empty keypoint then Red plays the keypoint. If 
uo or u„-i have not yet been played then Red should play onto one 
of them first. Stage I ends after the last keypoint is played by either 
Red or Blue. Note that the game may finish in Stage I. 

Stage II: If there is no empty keypoint and it is not Red’s last move 
then 

(i) if there exists at least one blue non-border interval, then Red 
should break a largest blue non-border interval by placing her 
point inside. 

(ii) if the border interval is the only blue interval, then there are two 
possible cases: 

(a) One of the blue endpoints of the blue border interval is a 
keypoint: 

Without loss of generality assume that it is uq (the other case 
is symmetric) and the other endpoint is 1 — £. From Stage I 
the other endpoint cannot be the keypoint u„-i so £ < 

Red now places her new point at x where x is anywhere in 
(£,uo). 

(b) Neither of the endpoints of the border interval are keypoints: 

Let £ be the length of the blue border interval; £ < ^- There 
must then exist a dichromatic key interval (Lemma 2). Red 
places her new point in that interval to form a new red in- 
terval of length > £. 

Stage II ends after Blue plays his last point. 

Stage III: If Red is placing her last point, we have two mutually exclu- 
sive cases: 

(i) if there exists more than one blue interval, then Red should break 
a largest non-border one. 

(ii) if there exists only one blue interval, then let its length be £; we 
will see below that £ < ^. Red should go to a bichromatic key 
interval (one will exist from Lemma 2) and claim a red interval 
of length > £ as follows. 

• If the bichromatic key interval is not the border one. Red 
can do this by creating a new red interval of length > £. 

• If the bichromatic key interval is the border one, then Red 
already possesses ^ of it because it has all of either [0, uq] 
or [un-i, 1]. Red can therefore go to the other side it does 
not possess, and grab enough length to have a red border 
interval of length > £. 

Theorem 2. The line strategy is a well-defined winning strategy for Red. 
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Fig. 2. There are four points to be played for both Blue and Red. The white 
dots represent Blue’s points and the black dots represent Red’s points. We label 
the dots in chronological order. 



Proof. Note that this strategy differs in at least one major aspect from the circle 
strategy: since we have lost circular symmetry it cannot be guaranteed that Blue 
plays onto at least one keypoint, and so it is possible that the game will end in 
Stage I, with Red playing all n keypoints. In this case, all red intervals (including, 
possibly, the border interval) are key intervals and all blue intervals have length 
< By Lemma 1 Blue and Red have the same number of monochromatic 
intervals, so Bm < Rm- If the border interval is monochromatic, then Bh = 
= 0 and Red wins. If the border interval is dichromatic, then one of its 
endpoints must be the red point uq or Un-i- This implies that Bi, < = ^, 

and Red wins. 

In what follows we may therefore assume that Blue plays onto at least one 
keypoint during the game. We will show that at the end of the game it will 
always be true that i?™ < Rm and, if the border interval is dichromatic, then 
Bb < Rb- The theorem will follow. 

First note that under this assumption Stage I always ends with all keypoints 
covered, and Stage III is reached. Note further that Red’s first move is onto 
either uq or Un-i- 

We consider Stage II. After Blue’s every move there exists at least one blue 
interval (possibly the border interval) by Lemma 1. If there is only one such blue 
interval, there is no red interval. Thus, one of the two conditions (i) or (ii) of 
Stage II holds. The strategy is clearly well defined in cases (i) and (ii)(a), the 
validity of (ii)(b) follows from Lemma 2. 

We will need one more observation. 

Lemma 3. After Blue’s last move, there is no blue key interval. 

Proof. Let k be the number of keypoints played by Blue, where 1 < k < 

I" . There are therefore at most k — 1 blue key intervals after Stage I. 

Red occupies n — k keypoints in Stage I, and so Stage II lasts for fc — 1 
rounds. This is sufficient for all blue key intervals to be broken, since a 
blue key interval is longer than any other blue interval in Stage II (i) 
and Stage II (ii)(a). 

We now prove that Red wins. There are two cases. 

Lemma 4. Assume that case (ii) of Stage II never occurs. Then all red 
intervals are key intervals after Blue’s last move. 

Proof. After Stage I all red intervals are key intervals, since Red has 
only played keypoints and all keypoints are covered. During Stage II 
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Red uses all her points to break blue intervals (since case (ii)(b) does 
not occur), and so creates only bichromatic intervals. As Blue cannot 
create red intervals, all red intervals remaining after Stage II are indeed 
key intervals. 

We examine the result of Stage III, assuming that case (ii) of Stage II did not 
occur. Assume first that Red plays case (i) of Stage III. There are then equal 
numbers of red and blue intervals left after the last move. Since all blue intervals 
have length < ^ by Lemma 3, and all red intervals are key intervals by Lemma 4, 
we have Bm < Rm- If the border interval is monochromatic, then Bh = Rh = 0 
and Red wins. If the border interval is bichromatic, then its red endpoint must 
be uo or Un-i (since Red plays case (i) of Stage II only), and so Bh < Rh = ^ 
and Red wins. 

On the other hand, assume now that Red plays case (ii) of Stage III. Then 
Blue has total length £, Red has > £, and so Red wins. 

We now consider the remaining case, where case (ii) of Stage II does occur. 

Lemma 5. Assume that case (ii) of Stage II occurs at least once. After 
Red’s last move in Stage II, there is no blue non-border interval and 

B<n. 

Proof. We prove that the statement is true after the last occurrence of 
case (ii) in Stage II, and after each subsequent move by Red. 

Consider the last occurrence of case (ii). Before Red’s move, the border 
interval is the only blue interval. 

If Red plays case (ii)(a), there is no blue interval at all after Red’s move. 

We have then Bm = Rm = 0 and B^ < Rh, which implies B <IZ. 

If Red plays case (ii) (b) , Red claims a red interval longer than the blue 
border interval. So after Red’s move we have B < TZ, as Bm < Rm and 
Bb = Rb = 0 - 

Consider now the remaining moves of Stage II. In all these moves Red 
plays case (i), and so we can deduce that Blue uses his move to cre- 
ate a new blue non-border interval. Red immediately breaks this blue 
interval. This leaves Bm, Rm, Bb, and Rb unchanged and destroys the 
only blue non-border interval. The claim therefore remains true after 
each subsequent move by Red, and in particular after Red’s last move 
in Stage II. 

We consider the situation right before Blue’s last move. By Lemma 5, there 
is no blue non-border interval, and B < n. 

If the border interval is blue before Blue’s last move and Blue uses that move 
to create a blue non-border interval move. Red plays case (i) of Stage III. This 
breaks the new blue non-border interval, returning B and TZ to their state before 
Blue’s last move, and so Red wins. 

In all other cases there is a single blue interval after Blue’s last move. It cannot 
be a key interval by Lemma 3, and so, by Lemma 2, there is a bichromatic key 
interval. Red claims a red interval longer than the blue interval, and wins. 
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5 Conclusions 

We have given strategies for one-dimensional competitive facility location, al- 
lowing the second player, Red, to win. The margin by which Red wins is very 
small, however, and in fact Blue can make it as small as he wants. Moreover, our 
arguments can easily be transformed into a strategy that allows the first player. 
Blue, to lose by an arbitrarily small margin. Do our findings have any bearing 
on the two-dimensional Voronoi Game? The concept of keypoints turned out to 
be essential to our strategies. Is there a similar concept in two dimensions? A 
natural attempt would be to define a uniform square grid of keypoints. Perhaps 
surprisingly, a player governing this grid can still lose the game by a considerable 
margin. 
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Abstract. Let F = {N, v) be a cooperative game with the player set N 
and characteristic function v : 2^ ^ R. An imputation of the game is 
in the core if no subset of players could gain advantage by splitting from 
the grand coalition of all players. It is well known that, for the linear 
production game, and the flow game, the core is always non-empty (and 
a solution in the core can be found in polynomial time). In this paper, 
we show that, given an imputation x, it is AP-complete to decide it is 
not a member of the core, in both games. The same also holds for Steiner 
tree game. In addition, for Steiner tree games, we prove that testing the 
total balacedness is AP-hard. 

Key words: cooperative game, core, network flow, linear programming, 
Steiner tree, AP-completeness. 



1 Introduction 

A cooperative game F = {N, v) consists of a player set N = {1, 2, • • • , n} and a 
characteristic function v : 2^ — >■ R. For a revenue game F, the core is defined by 

Core(P) = {x G i?” : x{N) = v{N) and x{S) > v{S), VS' C N}, 

where x{S) = study of the core are closely associated with another 

important concept, the balanced set. The collection B of subsets S of fV is 
balanced if there exists a set of positive numbers js, S £ B, such that for each i G 
N, we have XieSeB 7S = 1- ^ game {N, v) is called balanced if XseB isviS) < 
v{N) holds for every balanced collection B with weights {75 : S £ B}. With 
techniques essentially the same as the linear programming duality, Shapley [13] 
proved that a game has a non-empty core if and only if it is balanced. For a 
subset S C N, we define the induced subgame (S,vs) on S: vs{T) = v{T) for 

* Research is partially supported by a grant from the Research Grants Council of Hong 
Kong SAR (CityU 1116/99E) and a grant from CityU of Hong Kong (Project No. 
7001215). 
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every subset T of S'. A game is called totally balanced if all its subgames are 
balanced, i.e., all its subgames have non-empty cores. 

The computational complexity as a rationality measure for game theoretical 
concepts has attracted more and more attention recently. In cooperative game 
studies, work has focused on games that have a succinct representation, that is, 
the game value is defined as a solution to a combinatorial optimization probelm, 
for which the input size is polynomial in the number of players, since first sug- 
gested by Megiddo [11]. Various interesting complexity structures have started to 
emerge as a result, especially for the study of core, which will be the focus of our 
paper. Deng and Papadimitriou [4] found a game for which the core is nonempty 
if and only if a certain imputation (Shapley value in this case) is in the core. 
Faigle, et al. [5] show that, for the minimum cost spanning tree (MOST) game, 
membership testing is AP-complete. It is known, however, the MOST game has 
a non-empty core and a member in the core can be found in polynomial time 
[8]. Goemans and Skutella [7] recently showed that, for a facility location game, 
if the core is non-empty, a solution in core can be found in polynomial time, and 
membership testing can be done in polynomial time. However, it is AP-complete 
to decide if a core is not empty. In this paper, we consider the flow game and 
the linear production game. Both are known to have a non-empty core and a 
solution in the core can be found in polynomial time [9,10,12,2]. However, we 
show that, given a solution, it is AP-complete to tell it is not in the core. 

The flow game was introduced by Kalai and Zemel [9,10]. It is associated 
with the maximum flow from a source to a sink on a network, where each player 
controls one arc in the network. The value of a subset of players is the same as 
the maximum flow from the source to the sink on the subgraph induced by the 
edges controlled by that subset of players. The flow game is totally balanced, and 
on the other hand, every non-negative totally balanced game can be formulated 
as a flow game. Owen [12] introduced a linear production game in which each 
player j controls a certain resource vector V . Jointly, they maximize a linear ob- 
jective function cx, subject to the resource constraint Ax < j ^ value 
a subset S of players can achieve on their own is the maximum they can achieve 
with resources collectively owned by this subset: max{cx : Ax < 

Owen showed that the core for linear production game is always nonempty and 
one imputation in the core can be immediately obtained from his proof using 
variables of the dual linear program. Curiel [1] proved that the class of linear 
programming games is also equivalent to the class of totally balanced games. 
Therefore, the class of linear programming games is equivalent to the class of 
flow games. These reductions for the equivalence proof, however, involve in ex- 
ponential time and space in the number of players. Therefore, computational 
complexity results for these two games may not be equivalent. 

In Section 2, we first show that membership test in the core is co-AP-complete 
for flow games. Because of the special structure we have for the reduction, the re- 
sult flow game has a polynomial size linear production game formulation. There- 
fore, the same conclusion for linear production games follows as a result. In Sec- 
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tion 3, we consider the same problem for the Steiner tree game and show it is 
also NP-hard. 

In Section 4, we show it is NP-hard to decide whether a Steiner tree game 
is total balanced. Naturally, linear production game and flow game are total 
balanced. Deng, et ah, considered total balanceness of several interesting combi- 
natorial optimization games [3] . Most were shown to be polynomially decidable. 
The worst complexity is for a coloring game, for which total balancedness is 
equivalent to a graph being perfect graph. Our result is the first example of 
NP-hardness for total balanced condition. 

2 Complexity of Membership Test in the Core 
for Flow Games and Linear Production Games 

Theorem 2.1 It is NP-complete to show that, given a maximum flow game 
Fd = (E,v) on network D = {V,E]uj) and a vector x G {n = |i?|) with 
x{E) = v{E), is X not a core member of the game, i.e., does there exist a 
coalition S C E such that x{S) < v{S)7 

Proof. The problem is in NP because a coalition S C E with x{S) < 
v{S) (if it exists) can be exhibited and checked easily. To prove completeness, 
we establish a polynomial transformation from a basic iVP-complete problem, 
EXACT COVER BY 3-SETS [6]: 

Given a finite set X = {x\,X 2 , • • • , x^q} and a collection F = {/i, / 2 > • ‘ ‘ > /|f|} 
of 3-element subsets of X (|F| > q), does E contain an exact cover for X, that 
is, does there exist a subcollection F' C F such that every element of X occurs 
in exactly one member of F' ? 

For an instance of X3C, we construct a maximum flow game Jh on network 
D = {V,E-,to): 

Thus, the vertex set of I? is V = Vx 0 Vp U {s,t, vo,Wo}. (See Fig.l). 

— Vx = {vi,V 2 , ■ ■ ■ ,V 3 q}, one element-vertex Vi € Vx for each Xi € X, i = 

1.2, •••, 39 ; 

— Vf = {ui,U 2 , ■ ' ' ,u\F\}, one set-vertex Uj G Vp for each fj G F, j = 

1.2, ---,\F\; 

— source s, sink t, two additional vertices vq and Vq. 

The arc set E and the corresponding capacity function oj : E —>■ R~^ are given 
as follows: 

• ao = (s,z)o), ui{ao) = 6 q\F\; 

• bo = {vQ,t), u{bo) = 3\F\ - 3q; 

• Ai = {(uo, Ui) : i = 1,2, - , |F|}, Ve G Ai : w(e) = 3; 

• A 2 = {{ui,Vj) : Xj G fi,i = ^,2, - ■ ■ , \E\,j = 1,2,- • ■ ,3q}, Ve G A 2 : w(e) = 1 

• As = {{vj,t) : j = 1,2, • • ■ ,3q}, Ve G A 3 : w(e) = 1; 

• B = {{ui,VQ) : i = 1,2,- ■■ ,\F\}, Ve G R : w(e) = 3. 
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The arc set of D is E = {oq, 6 o}U U 2 I 2 U A 3 US, n = \E\. (See Fig.l) Consider 

maximum flow game Eu on this network D. Obviously, v{E) = 3|i^|. 

Let X G i?" be a candidate for a core member, which is given by: 



x{ao) 

x{bo) 

x{e) 

x(e) 

x{e) 

x(e) 



2q- 



1 



m 

3|F|-3g+( 

1 

3 ^( 9 +!)’ 

0, Ve G A 2 ; 

Ve G A 3 

0, Ve G B. 



1 



1 



3|i^| 3((7+1) 

Ve G Ai; 



Easily, we have x{E) = “ 3|-F| = v{E). We shall claim that x is not 

in the core of the maximum flow game Eo if and only if F contains an exact 
cover for X. 

Suppose that x is not in the core of Ed, and let S' C if be a coalition such 
that x{S) < v{S). We would like such an ‘unsatisfled’ coalition S to exhibit an 
exact cover. 

1) . S contains ao- Otherwise, x{S) > 0 = v{S), a contradiction. 

2) . S does not contain 6 g- Otherwise, if &o G S, we will discuss in two cases: 

Case 1. |S n A 3 I = A: < 3<7 — 1. 



v{S) < k + (3|F| — 3q), 

^{S) > ( 2 , - 3 ]^) + [3|f I - 39 + (^ - 3^)1 + > -M- 

Case 2. |S O A 3 I = 3q. 
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If [S' n All < |F| — 1, then v{S) < 3|F| — 3, 






’ + > v{S). 



If 15* n All = |F|, then v{S) < 3|F|, 



x(^) = (2g-i^) + |F| ^ 



I 



3|F|^ ' '3|f’|(g+I) ' ^'^■^^3|f’| 3((j + I)^^^^'^3 - 



It is contrary to the fact x{S) < v{S) in both Case I and Case 2, so bo ^ S. 
3). S contains all 3q arcs in A 3 , and v{S) = 3q. 

If 15” n A 3 I = k < 3q — 1, then 

x{S) > {2q - > fc + (^ - >k> v{S), 



which is a contradiction. So 15 fl A 3 I = 3 ( 7 . 

If v{S) < 3q, then we have v{S) < 3q — 1, since v{S) must be integer under 
the given integral arc capacities. It follows that 

a;(5) > ( 2(7 - + 3g^ > 3g - 1 > u(5), 

also a contradiction. So v{S) = 3q. 

4). S contains exactly q arcs in Ai. Suppose that |5 fl Ai| = A:, then 



x{S) = {2q- :J^) + k 



1 



3|i^r 3|i^|((7+l) 



1 1 u 

+ 3q- = 3q- -r-r(l )■ 

^3 ^ 3|FP q + r 



Since x{S) < v{S) = 3q, we have k < q+1. Again since each arc in Ai has only 
3 arcs in A 2 followed, so we must have k > q. That is, k = q. 

Therefore, under our assumption of a; ^ Core(/ii), the above discussions 
1) — 4) imply that F contains an exact cover of X. 

On the other hand, if F admits an exact 3-cover F' = - ■ ■ , fig}, let 

S = {ao} U {(wo,Uifc) : A: = 1,2, • • • ,g} U A 2 U A 3 , then 



v{S) = 3q, 
x{S) = {2q 



3|Fp + '^3|F|((7+l) 



+ q<3q, 



which implies that x is not in the core of the maximum flow game Fjy. 

Finally, easily to see that the construction of Fu and the candidate core 
member x can obviously be carried out in polynomial time. The proof is com- 
pleted. 

As a direct corollary, we come to the same conclusion for linear production 
games in Corollary 2.2, that is, the problem of checking membership in the core 
is also CO- AP-complete. 

Corollary 2.2 It is NP-complete to decide that given a linear production 
game F = (N,v) and a vector x € (n = |A^|) with x{N) = v{N), is x not 
a core member of this game, i.e., does there exist a coalition S C N such that 
x{S) < v{S)l 
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3 Checking Membership of Core for Steiner Tree Game 

Theorem 3.1 It is AT’-hard to decide that given a balanced Steiner tree 
game F{G) = (N,v) on weighted graph G = {V,E]lo) and a vector x £ 

(n = \N\) with x{N) = v{N), dose there exist a coalition S C N such that 
x{S) > v{S)7 

Proof. Given an arbitrary instance of EXACT COVER BY 3-SETS (X3C): 
a finite set X = {xi,X2, - ■ ■ ,Xn} and a collection F = {fi, f2£ ' ' , f\F\} of 3- 
element subsets of A {q> 2, |F| > 2q). We construct a Steiner tree game F{G) 
on weighted graph G = {V, E; oj) and a candidate vector x for a core member. 
The vertex set V of G consists of four parts: 

• Nx = {vi,V2, - ‘ ' , ’Vsq}, the vertex Vi £ Nx is corresponding to the element 
Xi £ X , i = 1,2, - ■ ■ , 3q, Nx represents a subset of consumers; 

• Np = {ui,U2, ■ ■ ■ , W|_F|}, the vertex Uj € Np is corresponding to the subset 
fj £ F, j = l,2p ■■ ,\F\, Np also represents a subset of consumers; 

• gi and g2 are two additional vertices, they represent switches; 

• vq represents central supplier. 

Thus, V = Nx U Np U {(71,(72, vo}. Let N = Nx U Np, n = |fV|. (See Fig. 2). 

The edge set E of G and edge weight function to : E ^ are defined as 
follows: 

• oo = (uo, gi), w(oo) = 2g - 1; 

•bo =(ffi,ff2), w(6o)=<7+1; 

• El = {{gi,Ui) : i = 1, 2, • • • , |F|}, Ve G : uj{e) = q+1; 

• E2 = {{g2,Ui) : i = 1,2, • • • , jF|}, Ve G E2 : w(e) = q; 

• Eo = {{ui,Vj) : Xj £ fi,i = 1,2,- \F\,j = 1,2, - •• ,3g}, 

Ve G Eo : (^(c) = 2q -t- 1; 

The edge set E = {00,60} U U E2 U E3. (See Fig. 2). Denote the Steiner tree 
game on the weighted graph G by E{G) = (N,v). 

Our construction of graph G is much similar to that in Faigle, et al.[5]. The 
only difference is the weights of edges in E^. The weights of edges in E3 is 
sufficient large so that each vertex in Nx does not serves as a switch in the 
minimum Steiner tree of G. For any coalition S Q N, denote the minimum 
Steiner tree w.r.t. S U {uoj by Ts = {Vs, Es). We have 

1) . If Iks O Np\ < (7 -I- 1, then Es C\ E2 = 0, and 

u(^) = u{Es) = {2q - 1) + (g + l)|k5 nv^l + (2(7 + 1)|5 n Ax|. (3.1) 

2) . If Iks O Np\ > (7 -I- 1, then Es (1 Ei = 0, and 

u(5) = cu{Es) = (2(7 - 1) + (,7 + 1) + q\Vs O Np\ + {2q + l)|5n iVx|. (3.2) 

Therefore, for the grand coalition N, v{N) = 6q{q -I- 1) -I- q\F\. Let 

x{ui) = q+ Vt= 1,2,---,|F|; 



Vj = 1,2,...,3(7; 
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Np {2q + 1) Nx 




Nf ^ {ui : i = 1, 2, • • • , |F|}; Nx = {vj : j = 1, 2, • • • , 3g}. 
(The characters in brackets are the corresponding edge weights) 

Fig. 2. G=(y,E;u). 



easily to verify that x G Core(r(G)), this game is balanced. 

Define x G i?" be a candidate for a core member: 

x{ui) = q, Vi= 1,2,---,|F|; 
x{vj) = 2q + 2, Vj = 1,2, 

We shall claim that x is not in Core(F(G)) if and only if F contains an exact 
cover for X. 

Assume x is not in Core(F(G)), and let S' C if be a coalition satisfying 
x{S) > v{S). Then S must possess the following properties: 1). S fl Nx yf 0; 2). 
\S n Np\ = q.; 3). S fl Nx = Sq- Therefore, the assumption of a: ^ Core(T(G)) 
implies F contains an exact cover of X. 

On the other hand, if F admits an exact 3-cover F' = {/ii, /ia, • • • > fig}, let 
S = {uii^ : k = 1, 2, • • • , g} U Nx, then v{S) = 7q^ -I- fig — 1 < 7q^ + 6q = x{S), 
which implies that x is not in Core(T'(G)). Easily to see that our construction of 
game F(G) and the candidate core member x can be carried out in polynomial 
time, so the proof is finished. 

4 A AP-Hard Total Balancedness Condition 

In this section, we prove the problem of testing total balancedness of the Steiner 
tree game is AP-hard. In our proof, we will use 3-PARTITION, which is shown 
to be AP-complete in strong sense in the book of Garey and Johnson [fi]. 

Theorem 4.1 Testing total balancedness for Steiner tree games is NP-hard. 
Proof. Given any instance of 3-PARTITION: A finite set = {oi, 02 , ..., 03 ™}, 
a bound B G and a size : s(a) G for each a G A, such that 

f ^ < s(a) < G A, 

\ EaGA s(a) = 
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We let 5 = {S' : S' is a 3-set of A}, p = |S| = and Q = (Q : Q is a (3m — 
l)-set of A}, q=\Q\= 3m. 




(The quantities in the brackets are the corresponding edge weights.) 



Fig. 3. G= {V,E]Uj) 



I. First we construct a weighted graph G = (F, E; oj). 

• The node set V = {vq} A N U M is given as follows: 

{uo} is corresponding to the supplier; 

^ is corresponding to the set of consumers; 

M = SUQU{uo} is corresponding to the set of switches. 

• The edge set E = EiA E 2 A E^ A E/^ A E^, and the edge weight function 
uj : E ^ are defined as follows: 

El = {(a, S) : a € S, for all a € A,S € 5}, Ve € Ei, uj{e) = 6mB; 

E 2 = {(a, Q) : a G Q, for all a G A,Q G Q}, Ve € E 2 , w(e) = 6 mB + 

Es = {{S,uo)-.SgS}, 



Ve = (S', Mo) G E 3 , w(e) 



B if EaGS S(a) < 

¥ if EaGS'S(o) > B; 



Ea = {(<5, Mo) : Q G Q}, Ve e E 4 , w(e) = niB - f ; 
-E5 = |(mo,mo)}, uj{uo,vo) = 5mB. 



Denote the corresponding Steiner tree game by E(G) = {N,v). For conve- 
nience of comprehension, we give a sketch of the graph G = {V,E;lo) in Fig. 3. 
Notice that 

1). The weight of each edge e G Ei A E2 is sufficient large so that in any 
minimum Steiner tree of G w.r.t. S U |mo} (S C N), there are exactly |S| edges 
in El U i? 2 . 
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2). The construction of edge (uo,vq) is to ensure that for any proper subset 
of A, the vertices in S are not needed in the corresponding minimum Steiner 
tree. 

II. We show that the cores of all proper subgames of T(G) are non-empty. 
Let S be any proper subset of N, 151 < 3m — 1. Assume that Ts = (Vs,Es) 
is the minimum Steiner tree of G w.r.t. S U {uo}) then it must be the case 
Vs = {w} U 5 U Q, where the correponding set Q G Q including S. That is, Es 
consists of one edge in E^ and |5| edges in E 2 , 



v{S) = QmB\S\ + +mB-\B := uj. 

o o 



(4.1) 



Let 



5B {3m-l)B 






Vo G S, 



easy to verify that it is an element in the core of E{Gs)- 

III. We prove that the game E{G) has non-empty core if and only if A has a 
3-partition. 

Assume that T = (Vat, Em) is the minimum Steiner tree of G w.r.t. iVU{uo}, 
then we have 

Vn = {wo} U A^ U {5i, • • • , 5m}, 



where 5i, • • • , Sm G S and IJ™ ^ Si = A. That is, Em consists of the edge (uq, vq), 
m edges in A 3 and 3m edges in E\, and 



3B 13 

v{N) < 6mB ■ 3m + + ^mB = ISm^B + —mB := Vm- 



(4.2) 



In fact, 

a) . If Em doesnot contain any edges in A 1 UA 3 UA 5 , then it must consist of 3m 
edges in E2 and two edges in E4, and v{N) = 2{mB — ^) + 3m{6mB + ^) > v%, 
a contradiction. 

b) . If Em contains k {0 < k < 3m) edges in E2, then it must contain one edge 

in A 4 , and v{N) > {mB— E) + k{6mB+^)+5mB+{3m—k)6mB+ \B > 

Vm, also a contradiction. 

If A has a 3-partition, then 



v{N) = 6mB ■ 3m + mB + 5mB = ISm^B + 6mB. 



Let x{a) = 6mB + 2B, Va G N. By formula (4.1), easy to verify that it is a core 
member of E{G). 

If A has no 3-partition, suppose the core of E{G) is not empty and x G 
Core(A(G)). Since x{N) = v{N) > -|- 6mB, there must be a (3m — 1)- 

element subset Q* of N such that 

‘^771 1 

x(Q*) > — (ISm^i? -I- 6mB) = (3m — l)(6mB + 2B). 

3m 

By the formula (4.1), we have 

v(Q*) = 6mB(3m—l) + ^B(3m — l)+mB—— = (3m—l)(6mB + 2B) < x(Q*), 
o o 
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which is contrary to our hypothesis that x G Core(I^(G)). 

Therefore, the T(G) is totally balanced if and only if A has an equivalent 

3-partition. Also the construction of T'(G) can be carried out in polynomial time, 

so testing total balancedness of Steiner tree games is AP-hard. 
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Abstract. This work proposes two identification algorithms based on 
some difficult problems. The first scheme is an extremely simple combi- 
nation of the idea of one-time passwords and the use of public keys. The 
second proposal allows identification to be implemented without leaking 
any secret information during the interaction because it combines the 
two mentioned concepts with a zero-knowledge proof of the possession 
of a solution to a difficult graph problem through a challenge-response 
technique. 



1 Introduction 

Identification schemes are designed to allow one party (the verifier) to gain assur- 
ance that the identity of another (the prover) is as declared. The most common 
technique is the use of secret time-invariant passwords that are shared between 
both participants. This scheme corresponds to a maximum-disclosure proof and 
its major security concern is eavesdropping and subsequent replay of the secret 
information. There are two well-known solutions to this security problem. The 
weakest of both methods uses one-time shared passwords, whereas the strongest 
solution consists in applying the idea of challenge-response protocols that allow 
the prover A (Alice) to convince the verifier B (the system) that she has some 
information without revealing it. This idea corresponds to minimum-disclosure 
proofs and has been implemented through several techniques as for example 
zero-knowledge proofs. This last concept was formalized in 1986 by Goldreich, 
Micali and Wigderson, [3], who showed that any problem in NP can be given 
a zero-knowledge proof. This kind of proofs implies zero disclosure, the use of 
random numbers as challenges and the application of bit commitment schemes. 
The recent reference [2] provides an extensive review to the area of probabilistic 
proofs. There we can find assertions such as: ’’The development of techniques 
for the construction of efficient zero-knowledge proof systems is still of interest” , 
which refers exactly to the subject of this work. 

The work [4] proposes a zero-knowledge proof for the solution to the graph 
isomorphism problem. Here we use the same problem as basis for the demon- 
stration of the knowledge of a solution to other difficult problems, [1] . Note also 
that in our proposals there is no need to know how to solve any difficult problem 
because the user builds an adequate graph for each chosen solution. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 257-261, 2001. 

© Springer- Verlag Berlin Heidelberg 2001 
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This paper is organized as follows: In the following section we comment the 
main ingredients of our protocols. Sections 3 and 4 present respectively a sim- 
ple protocol based on one-time passwords and a scheme based on the challenge- 
response technique. Finally the work ends with some conclusions and open prob- 
lems. 

2 Ingredients 

The proposed schemes combine challenge-response techniques with one-time 
passwords. Now we comment some of the main aspects of these ingredients. 

2.1 Challenge-Response 

According to the challenge-response technique, the steps followed by the partic- 
ipants A and B can be generally described in the following way: 

1. A chooses a random element (witness) of a predefined set, which defines a 
set of questions (challenges) whose answers (responses) only she knows. 

2. B chooses one of the challenges. 

3. A provides the corresponding response. 

4. B verifies the received response and witness. 

If the number of challenges is c, then the probability of A’s fraud after m 
iterations is c”™. Thus, to decrease A’s fraud probability, the number m of 
iterations should be increased. Also, to protect the secret information, A should 
answer, at the most, to one question for each witness, and never reuse any 
previous witness. On the other hand, formal zero-knowledge proofs should verify 
the properties of completeness, soundness and zero-knowledge, [2]. 

2.2 Graphs 

Another important ingredient for these protocols is the difficult graph problem 
used as basis to guarantee their security. In order to make more efficient the 
verification phase, this problem should be ’’one-way”, that is to say, it should 
be such that it is difficult to find a solution but however it is very simple to 
verify a solution. In these proposals the user has to follow a building process 
for the construction of an adequate graph from a randomly chosen solution. 
This process has a different computational complexity depending on the specific 
graph problem, so this aspect has to be also considered when selecting the graph 
problem. So, it seems that the most suitable difficult graph problems are those 
where the solution is a vertex subset, such as the vertex cover, the independent 
set or the clique problems. Regarding the building process, in both protocols, 
once chosen a problem P, a number of vertices n and a solution S, the user A 
has to build a graph G where S is a solution to the problem P. In order to do it, 
A can apply one of the two following methods: 

CM) Constructive method: By adding edges to the null graph N„ 

DM) Destructive method: By deleting edges from the complete graph K„ 
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2.3 Representations 

Throughout this work, every secret identification password corresponds to a so- 
lution of a difficult graph problem. So, regarding memory requirements, special 
attention should be paid to the solutions, graphs, functions and problems repre- 
sentations. Every solution is here symbolized by the decimal value of the binary 
representation where the unitary bits indicate the presence of the corresponding 
elements of the solution, so in the first protocol, since this is the only object 
that has to be sent from A to B, the communication complexity is not a great 
problem because it only implies to send a binary vector whose length depends 
on the kind of the used problem. On the other hand, the problems have to be 
published in a certified data base. So, a code dictionary for the list of possible 
problems can be published in the same directory allowing the use of a short code 
corresponding to each problem. 



3 Non-probabilistic Protocol 

Some of the main ideas of the scheme proposed in the following section are 
introduced now with the definition of the first extremely simple proposal based 
on one-time passwords. The description of this protocol is as follows: 

Precalculation 

1. A chooses a graph problem P and an adequate solution, and states the 
integer S corresponding to this solution 

2. A builds a corresponding graph G for P and S 

3. A states as her public key the pair (P,G) 

Interaction 

One-time secret password: A positive integer S 

One-time public information: A graph G and a problem P 

1. A sends to B the solution S 

2. B checks that S is a valid solution according to A’s public key (P,G) 

This scheme is non probabilistic, so after applying it, there should be no 

doubt about the identity of the user A thanks to the difficulty of the problem 
and the well choice of the parameters. Another advantage of this proposal is the 
no need to share any secret information between both parties, in contrast with 
other well-known one-time password schemes. Note that with this protocol, once 
used a solution S, it looses its secret condition, so it is necessary to update it 
after each identification process. So, during the precalculation phase, A has to 
carry a building step that is completely independent from previous performances 
of the identification scheme because the solution changes in every running. In 
order to make more efficient this building step, in the following section we will 
propose a probabilistic proof where the user A can take approach of previous 
performances to build new graphs. 
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4 Probabilistic Protocol 

In this section we present a general interactive challenge-response scheme based 
on difficult graph problems. Now an informal description is given prior to the 
formal definition. The protocol can be divided into two parts. The first part is an 
off-line precalculation of the prover where she chooses her fixed secret identifica- 
tion number S, and her one-time public information: a problem P and a graph 

G. The second part corresponds to the zero-knowledge procedure composed of m 
iterations where A commits to certain information, B chooses a challenge, A re- 
sponses and B verifies the answer and the commitment, so that finally it accepts 
that A possesses a valid secret identification number. So, the formal description 
of the algorithm is as follows: 

Precalculation 

1 . A chooses a graph problem P and a suitable solution, and states the integer 

5 corresponding to this solution 

2. A builds an adequate graph G where S represents a solution to P 

3. A updates the pair (P,G) in the public data base 

Interaction 

Fixed secret information: A positive integer S 

One-time public information: A graph G and a problem P 

The following four steps are executed m times: 

1. A generates at random an isomorphic copy H of G, and sends it to B 
jointly with the cipher of both, the isomorphism f and the solution T=f(S) to 
the problem P in H 

2. B obtains from the public data base the information (P,G) associated to 
the user A and selects randomly a bit b. Depending on its value, it requests her 
to decipher: 

(a) ^ a characteristic question of the solution T, if b=0 

(b) the isomorphism f, if b=l 

3. A responses to the challenge 

4. Depending on the selected challenge, B verifies the corresponding cipher 
and checks that: 

(a) T fulfills the properties of a valid solution to the problem P in the graph 

H, if b=0 

(b) f transforms the graph G in its isomorphic copy H, if b=l 

In this scheme there exists certain probability that a fraudulent prover passes 
the verification. On the other hand, the proof of its completeness, soundness 
and computational zero- knowledge properties is easily affordable. Note also that 
with this scheme the solution S does not loose its secret condition, so it is not 
necessary to change it after each running and there is no need to share any 
secret information. Furthermore, this probabilistic proof has the advantage of 
requiring less off-line operations than the previous proposal, provided that the 
building process can be implemented in a way that for one fixed solution and 

^ The step 2(a) can be replaced by the total disclosure of the solution T when the 
knowledge of its cardinality does not help to deduce S. 
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for each different problem, the generation of every graph can take approach of 
previous generations, or even though it can be easily carried out in parallel. 
Finally, an important aspect of this protocol is that it requires the assumption 
of the following hypothesis: 

’’The knowledge of different pairs of difficult problems and corresponding 
graphs for which there exists a common solution does not help to solve any of 
the graph problems”. 

5 Conclusions and Open Problems 

The primary aim of this work is the design of cryptographic protocols in order 
to solve the identification problem. The schemes here proposed constitute two 
elegant models that allow the application of many different problems from the 
Graph Theory. They are the result of the combination of the idea of one-time 
passwords with public keys in the first case, and commitment schemes with 
challenge-response in the second. Also the NP-completeness is suggested to be 
used here with a positive use as a basis for both schemes. 

Some questions about the use of the protocols to be recalled are the following: 

a) the usefulness of these schemes relies on the great variety of graph problems 
that can be used as basis for the application of both protocols. 

b) the basis problem should be ’’one-way”, that is to say, such that it is 
difficult to find a solution but however it is extremely simple to verify a solution. 

A forthcoming version of this work will include the comparison among the 
computational efficiency of the algorithms, the search of the most suitable in- 
stance graphs and problems, the analysis of minimal assumptions for the imple- 
mentations and the study of the possible parallelization of the second proposal. 
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Abstract. This paper presents a new exponentiation architecture and 
multiplier/squarer for GF(2'”), which uses a standard basis representation. The 
proposed multiplier/squarer is used as kernel architecture of exponentiation. 
Although the proposed multiplier/squarer computes the multiplication and 
squaring operations at the same time in GF(2“), the common parts existing in 
both operations are only executed once, thereby reducing the required hardware 
compared to related systolic circuits. The proposed multiplier/squarer can be 
easily applied to exponentiation architecture. It is also well suited to VLSI 
implementation because of its regularity, modularity, and unidirectional data 
flow. 



1 Introduction 

Finite fields GF(2"‘) have found useful in error-correcting codes and cryptography 
[1][2], however, these applications usually require the computation of multiplications, 
squares, and exponentiations in GF(2"‘). Modular exponentiation is the core 
computation of numerous public key cryptosystems including RSA. As such, to satisfy 
real-time requirements, efficient hardware structures for such operations are desirable. 

The conventional method for computing squares in GF(2'") over a standard basis 
involves a multiplier. Recently, a low latency squarer for GF(2'") with bi-directional 
data flow over a standard basis was proposed which involves less chip area and a 
smaller latency than using a multiplier in GF(2'"). For systolic array implementations 
of exponentiation architecture for GF(2"‘), the architectures in [3] and [8] can be used. 
The circuit in [3] consists of the 2{m-\) multipliers in [5]; the circuit in [4] requires 
the m-\ power-sum circuits. The architecture in [3] involves bi-directional data flow, 
while those in [6] and [7] involve unidirectional data flow. 

This paper proposes a new systolic multiplier/squarer and new systolic 
exponentiation architecture for GF(2"‘) with unidirectional data flow over a standard 
basis. Both circuits involve a smaller latency and less hardware complexity than 
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existing related systems, plus they are well suited to VLSI implementation with fault- 
tolerant design. 



2 Exponentiation Architecture for GF(2'") 

Unlike addition and multiplication, exponentiation in GF(2"‘) is very similar to an 
operation for ordinary integers. Let^(x) and B{x) be elements in GF(2'"), and P{x) be 
the primitive irreducible polynomial of degree m [5]. The three polynomials A(x), 
B{x)anA P(x) are written as 



m~\ m~\ m 

= = P(x) = Y,Pkx\ P„=i 

k=l k=\ k=l 

If e = e„_ 2 , ..., eo] is the exponent, a right-to-left algorithm can be performed 

to compute A(xy mod P(x) over GF(2"‘). Ordinary right-to-left algorithm in [8] can be 
slightly changed to Alg. 1, which uses a new multiplication/squaring algorithm, M/S 
algorithm, as a sub-ftmction for exponentiation algorithm. M/S algorithm will be 
described in section 3. 

[Algorithm 1] Exponentiation Algorithm using M/S Alg. 

Input A{x) , e, P{x) 

Output E{x)=A{x) mod P(x) 

Step 1 E{x)=l, T{x)=A{x) 

Step 2 for 2=0 to m-1 do 

Step 3 if e,= = l (e‘"’ ( x) , t'"’ ( x) ) =M/S (x) , t'"'” (x) ) 

else (e‘*’ (x) =e'"'“ (x) , r‘*’ (x) ) =M/S (e'"" (x) , t‘""’ (x) ) 

In step 3, M/S algorithm is called with two parameters and and it 

returns two computation results of multiplication, E^‘~^\x)*'f ‘'^\x), and squaring. 




Fig. 1. Proposed exponentiation architecture in GF(2'") 
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Square result is stored to But, the variable Ef ‘\x) stores a value 

depends on the value of Cj. Returned multiplication result is passed to E^‘\x) when U; 
has value 1, but previous result, Ef ''^\x), is stored to E^‘\x) in the other case. 

Fig. 1 shows a new sequential exponentiation architecture using a new systolic 
multiplier/squarer, which operates Alg. 1. This architecture includes a 
multiplier/squarer as its kernel architecture. The multiplier/squarer consists of main 
two parts, the MUL and the SQR. The MUL operates a multiplication operation 
whereas the SQR operates a square operation. IPL register keeps an irreducible 
polynomial. While the multiplier/squarer computes the multiplication operation, it also 
processes the squaring operation concurrently forming the next square term. The 
switch depicted in Fig. 1 decides the multiplication result, which chooses the previous 
result or the current result depends on the exponent. 



3 Systolic Multiplier/Squarer for GF(2") 

The computation steps for the modular multiplication M(x) = A(x)B(x) mod P(x) and 
square S(x) = A(x)A(x) mod P(x) are follows as: 

M{x) = A(x)B(x) mod P(x) 

= boA{x)+b\[A{x)x mod F’(x)]+. ..+bn,-\[A{x)x"'~^ mod PCx)] (1) 

5(x) = A(x)A(x) mod P(x) 

= aoA(x)+ai[A(x)x mod F’(v )]+. . .+a^.i[A(x)x"‘~^ mod PCx)] (2) 

In the above two equations (1) and (2), [ ] is common to both. Thus, it is much more 
efficient to compute the common parts only once when multiplication and squaring 
operations are carried out simultaneously. As such, equation (1) can be reformulated 
into the recurrence form, called LSB-first form as follows: for \<i<m 

I\^\x) = + b,_^A^'-\x), A%) = A(‘-'\x) X mod P(x) (3) 

where A*-°\x) = A(x), A/°\x) = 0, and lv^'\x) = bfyi{x) + bi[A{x) x mod PCx)] + b 2 [A{x) 
x^ mod PW] + ... + bi_i[A{x) x'"' mod PCx)]. For i^m, l\^"'\x) = M(x) = A{x)B{x) mod 
P{x). Note that the two equations in (3) can be performed in parallel. 

The above squaring operation (2) can be reformulated to the following LSB-first 
recurrence form just like equation (3) derived for multiplication: \<i<m 

^‘\x) = ^'-\x) + a,.,A^‘-\x), A%) = A(‘-'\x) x mod P(x) (4) 

where A^°\x) = A(x), S^°\x) = 0, and = aoA(x) + ai[A(x) x mod PCx)] -t a 2 [A{x) x^ 
mod PW] -t . . . -t ai_i[A(x) x'"' mod /"(x)]. For i=m, y^^^x) = 5(x) = A(x)A(x) mod P(x). 
Note that the two equations in (4) can also be performed in parallel. 

The following bit-wise LSB-first algorithm computing both modular multiplication 
and squaring simultaneously can be derived from the above equations (3) and (4): 

[Algorithm 2] Multiplication and Squaring Algorithm : 
M/S(E(x), T(x)) 

Input A (x) =T(x) = a„ ) 

B(x) =E(x) = h„ ) 
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P(x) = (l, p„_^, p„ ) 



Output 


Mix) =A{x) B{x) mod 


P(x) ,S(x) =Aix) 


A[x) mod 


Step 1 




m‘°’„) = (o, 0, ... 


, 0) 




S‘“’„) = (0, 0, ... 


, 0) 


Step 2 




a‘'’.^= 0, 0<i<m 
for 2=1 to m do 




/ — / ) 


Step 3 


for j=l to m do 






Step 4 
Step 5 
Step 6 


,=M' ' ,+i), ' , 

S'^’""=s'"^’""+a*’ a'"””" 

m-j m-j-1 £r m-j ra-1 





Fig. 2 shows a DG of Alg. 2 over GF(2"*). The node at position (1 < i,j <m,m^ 
4) performs three recurrence equations of Alg. 2 in parallel. DG with regular data 
flows can be easily transformed to a systolic array via a space-time transformation [4]. 




Fig. 2. Dependence Graph for GF(2‘') 

The time transformation vector t is equal to [2, 1] because the vector of the 
equitemporal hypetplane is [1, -2]. The minimum number of PEs to be mapped is 
when the j column, [0, 1] can be fixed to the projection vector. The space 
transformation vector S , which is vertical to the projection vector, is [1, 0]. By using 
the vector S , the nodes on the same row in Fig. 2 can be mapped to the same PE. The 
proposed systolic multiplier/squarer is shown in Fig. 3. 



4 Conclusion 

This paper have presented a new systolic multiplier/squarer computing multiplication 
and squaring operations at the same time, and a new serial-in-serial-out exponentiation 
architecture for GF(2"‘) using a standard basis representation. Although computing 
multiplication and squaring operations at the same time in GF(2'"), the proposed 
systolic squarer/multiplier requires less hardware than a dedicated systolic multiplier. 
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(a) Systolic Multiplier/Squarer over GF(2'*) 




Fig. 3. Proposed Systolic Multiplier/Squarer 



Table 1. Comparision of architectures on GF(2“) 



Item 


Mul. in Ref [6] 


Mul. in Ref [7] 


Proposed Arch. 


No. of PEs 


m 


m 


m 


Latency 


3m 


3m 


3m 


PE complexity 


3 AND gates 


2 AND gates 


3 AND gates 




2 XOR gates 


2 XOR gates 


3 XOR gates 




2 MUX(1X2) 


2 MUX(1X2) 


3 MUX(1X2) 




10 Latches 


1 1 Latches 


14 Latches 


Critical path 


1AND+2XOR+ 


1AND+1XOR+ 


1AND+1XOR+ 




lMUX+1 Latch 


IMUX+lLatch 


IMUX+lLatch 



As shown in Table 1, the PE complexity of the proposed multiplier/ squarer was 
higher than that of existing multipliers. This fact is undeniable because the proposed 
multiplier/squarer computes modular multiplication and squaring simultaneously, 
whereas the other multipliers compute only modular multiplication. The proposed 
hardware is well suited to VLSI implementation because of its regularity, modularity, 
and unidirectional data flow. 
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Abstract. We present a simple space saving trick that applies to many 
previous algorithms for transitive closure and shortest paths in dynamic 
directed graphs. In these problems, an update can change all edges inci- 
dent to a node. The basic queries on reachability and distances should 
be answered in constant time, but also paths should be produced in time 
proportional to their length. For: 

Transitive closure of Demetrescu and Italiano (FOGS 2000) 

Space reduction from 0{n^) to O(n^), preserving an amortized up- 
date time of 0{n^). 

Exact all-pairs shortest dipaths of King (FOGS 1999) 

Space reduction from 0{n^) to 0(n^\/n&), preserving an amortized 
update time of 0{n^V^), where b is the maximal edge weight. 
Approximate all-pairs shortest dipaths of King (FOGS 1999) 
Space reduction from 0{n^) to O(n^), preserving an amortized up- 
date time of 0{n^). 

Several authors (Demetrescu and Italiano, FOGS 2000, and Brown and 
King, Oberwolfach 2000) had discovered techniques to give a correspond- 
ing space reduction, but these techniques could be used to show only the 
existence of a desired dipath, and could not be used to produce the actual 
path. 



1 Introduction 

This paper is on saving space for dynamic transitive closure and shortest paths 
in dynamic directed graphs. For example, the space saving for transitive closure 
is from O(n^) to O(n^). Papers on algorithm are often more concerned with 
time than space, but a bad space complexity hits like a wall, whereas time only 
kills slowly. Ignoring for a moment the O-notation, if a computer has gigabyte 
of memory, with space, you can only run on graphs with up to 1, 000 nodes, 
whereas with space, you can deal with more than 30, 000 nodes. Our space 
savings techniques are simplifying and tend to decrease rather than increase the 
constants hidden in the O-notation, both in time and in space. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 268-277, 2001. 
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A fully dynamic graph algorithm is a data structure for a graph which imple- 
ments an on-line sequence of update operations that insert and delete edges in 
the graph and answers queries about a given property of the graph. A dynamic 
graph algorithm should process queries quickly and must perform update oper- 
ations faster than computing from scratch (as performed by the fastest “static” 
algorithm) . 

In this paper, we give dynamic algorithms for weighted directed graphs. The 
problems considered are as follows, where the queries range over all pairs of ver- 
tices u and w. 

Transitive closure: 

Reachability query: Is there a path from u to w in the current graph? 

Path query: Return a path from u to w. 

Exact all-pairs shortest paths: 

Distance query: What is the distance from u to wl 
Path query: Return a path from u to w. 

Approximate all-pairs shortest paths: 

Distance query: What is an upper bound on the distance from u to w within a 
factor of (1 -I- e)? 

Path query: Return a path from u to ic of length within a factor (1 -|- e) of the 
distance. 

The following update operations are allowed: 

— multi-insert (A.;,): inserts a set of edges incident to the same vertex v. 

— delete(e): deletes any edge currently in the graph. 

In this paper, we are only considering amortized time bounds. Deleting an edge 
is free in that it is paid for by the preceding insertion/initialization of the edge. 

We present a simple space saving trick that for all the above problems reduce 
the space substantially in the current fastest algorithm. More precisely, assuming 
that the graphs start with an empty edge set, we achieve: 

Transitive closure of Demetrescu and Italiano [3,4] Space reduction from 
0{n^) to O(n^), preserving an amortized multi-insert time of 0{n^). 

Exact all-pairs shortest dipath of King [9] Space reduction from 0{n^) to 
0{n^\fnb), preserving an amortized multi-insert time of 0{n^\/nb), where b 
the maximal edge weight. 

Approximate all-pairs shortest dipaths of King [9] Space reduction from 
O(n^) to O(n^), preserving an amortized multi-insert time of O(n^). 

The above update times should be compared with the best static algorithms. 
Transitive closure and approximate shortest paths can be computed essentially 
as fast matrix multiplication [8,10,13], which takes time, and for exact 

all pairs shortest paths, can be computed in time, again using fast 
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matrix multiplication. Thus, the amortized update times may not seem that 
impressive. However, fast matrix multiplication is considered rather slow, and 
it is an open problem to construct a combinatorial algorithm just for transitive 
closure with sub-cubic running time. Our algorithms are purely combinatorial, 
and in this view, the dynamic speed-up is optimal in that we can construct a 
graph from scratch with a multi-insert from each vertex, paying a total cost 
of n X 0{n^) = 0{n^) to construct the transitive closure, thus matching the 
best static combinatorial time bound. Our reduction of the space from 0{n^) to 
0{'n?) adds optimality with respect to space. 

Our ideas also apply to the older single source algorithms of Even and 
Shiloach [5] and of Ramalingam and Reps [11], improving their internal space 
from 0(m) to 0{n). Hence, if used for all pairs shortest paths, the improvement 
is from 0{mn) to 0{n^). We are here particularly interested in Ramalingam and 
Reps’ algorithms because it has proved efficient in practice [7,6]. Our idea also 
simplifies the implementation which typically reduces the worked performed. 

A common theme of all the above algorithms is that they keep tables of dis- 
tance information along with witnesses. A table entry can only increase if all its 
witnesses are lost. This theme goes back to Even and Shiloach’s decremental sin- 
gle shortest path algorithm [5] , and is also used in the fully-dynamic single source 
shortest path algorithm of Ramalingam and Reps [11]. Unfortunately, there may 
be many witnesses for each entry, and it is the storage of these witnesses that 
cause a prohibitive space-overhead. 

Recently space improvement similar to ours were reported by Brown and 
King [2] and of Demetrescu and Italiano [4] . Their idea was to maintain a counter 
of the number of witnesses for each entry. Again, the entry can only increase if 
its counter goes to 0. The problem in their solution is that they loose track of 
witnesses, and then they cannot answer path queries. 

Our challenge of identifying witnesses rather than just knowing their exis- 
tence is similar in spirit and motivation to that of finding witnesses for Boolean 
matrix multiplication as done by Alon et al. [1], though the techniques we apply 
here are combinatorial and completely different. 



Our contribution is to observe that if we are careful about the order in which 
we scan for witnesses, then we only need to store the first witness that we meet. 
When a witness is lost, we just continue the scan for the next witness. Our 
problem is then to identify scanning orders and properly define witnesses so 
that the same potential witnesses are not scanned repeatedly. 



Contents First, in §2, we present the simplest version of the trick, applying it 
to Even and Shiloach’s classic algorithm [5]. In §2, we also point out the modi- 
fication needed for Ramalingam and Reps’ algorithm [11]. Next, in §3, we show 
how to tailor the trick to King’s exact shortest paths and transitive closure 
algorithms [9] . For space reasons we defer the treatment of King’s exact and ap- 
proximate shortest paths and Ranmalingam and Reps shortest paths algorithm 
to the journal version of this paper. 
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Notation We working on a dynamic directed weighted graph G = (E, V) with 
edge weight function £ : E ^ N. For any subgraph H of G, V (H) is its vertex 
set and E{H) its edge set. For any pair of vertices {v,w), dist{v,w) denotes 
their distance in G. For each vertex v, in(v) and out{v) denotes its incoming 
and outgoing edges in G. 

2 Even and Shiloach’s Algorithm 

Our basic idea is most easily presented in terms of Even and Shiloach’s decre- 
mental single shortest path algorithm [5]. Their result is that we can maintain 
distances up to some threshold A in 0{mA) total time. In [5] they just consider 
unweighted graphs and threshold distance A = n, giving them an 0(mn) bound 
for maintaining all distances from the source. We generalize this to weighted 
graphs. 

For a given source node s, the Even-Shiloach algorithm maintains a shortest 
path graph Els which is the union of all shortest paths from s of length at most A. 
For each vertex v, we maintain its distance ds{v) from s, and, for each distance 
t = 0, ..., A, we maintain the set of vertices v with ds{v) = i. During our response 
to an edge deletion, ds{v) will be incremented by one at a time until it reaches 
the new distance from s to v. 

In the shortest path graph Hs, each edge (u,v) is a witness of the current 
distance to u, in the sense that dist{s,v) = ds{v) = ds{u) + £{u,v) = dist{s,u) + 
£{u,v). Hence, when an edge (u,v) is deleted, it affects distances only if (u,v) is 
in Els, and there is no other edge (x, v) to v in Hs- 

Suppose the deleted edge {u, v) is a last witness to v in ffg. This implies that 
ds(v) has increased, and the outgoing edges (v,w) are no longer valid witnesses 
of ds{w). A cascading effect is started: when a node w loses all its incoming edges 
in Hs, we must delete all its outgoing edges in Hs- 

To update Hs efficiently, the cascade is performed one distance i at the 
time. Let (u,v) be the original edge deleted. Starting with i ^ ds{v), we do 
as follows. For each vertex w with ds{w) = i and no incoming edges in Hs, we 
remove all its outgoing edges in Hs, and increment ds(w) by one. This preserves 
ds(w) < dist{s,w), and now, for all u with ds{u) < i, we know that ds{u) = 
dist{s,u). Next, for each w with ds{w) increased, we consider all its incoming 
edges (x,w) in G. The edge (x,w) witnesses ds{w) = dist{s,w) if ds{x) < i and 
ds{x) + £{x,v) = ds{w), and if so, (x,w) is added to Hs- If no such witness is 
found, we still have ds{w) < dist{s,v). We now set i ^ i + 1 and repeat. We 
terminate when t = Z\ + 1 or when we get to a distance i where all nodes have 
incoming edges in Hs- 

For a detailed description and analysis of the above algorithm, the reader 
is referred to [5]. The running time is dominated by the scanning for witnesses; 
each time we increment ds{w), we scan all its incoming edge to see if they witness 
the new distance. Since this can happen at most A times for each w, the total 
cost is 0{A ■ E»gv(G) ^ ^(G)}|) = 0(Am). 
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2.1 Our Space Saving Trick 

The internal space of the Even-Shiloach algorithm, i.e. the space beyond the 
input graph G, is 0{m) for storing the shortest path graph Hg- We will reduce 
this to 0{n). If we want a decremental data structure for each node, then the 
space is reduced from 0{mn) to 0{n^). 

For each vertex v, its list in{v) of incoming edges in G stores the incoming 
edges in some order. Instead of storing all of Hg, for each v, we store only the 
edge (x,v) from Hg which is first in in{v). We then have the unique shortest 
path tree Tg from s, spanning all nodes within distance A and such that the 
parent pointer of each node v is the first edge in in{v) which is on a shortest 
path from the source to v. 

The deletion of an edge {u,v) only has a consequence if {u,v) is in Tg. In 
that case, we search for a replacement witness. Starting from the successor of 
(u,v), we scan the edges in in{v), stopping if we reach a new witness (x,v) with 
dg(x) +£(x,v) = dg(v). If that happens , we replace (u,v) with (x,v) in Tg, and 
stop. Else we conclude that v has also lost its last incoming edge in Hg. As in 
the Even-Shiloach algorithm, each incoming edge to each vertex v is considered 
at most once for each value of dg{v). Hence, we preserve the running time of 
0{Am). However, since Tg is a tree, we have reduced the internal space from 
0{m) to 0(n). Also we save work when edges from Hg \ Tg are deleted. 



3 King’s Algorithms 

As the basis for all the fully-dynamic algorithms of King in [9], we want to 
maintain all distances up to some threshold A. Besides edge deletions, we have 
multi-inserts that for each vertex v can add any set of edges incident to v. The 
amortized cost per multi-insert is 0(Z\n^). This also pays for all deletions. The 
space is 0{n^), but we reduce it to 0{An‘^). 

The algorithm uses the Even-Shiloach algorithm to maintains 2n deletions- 
only data structures for distances up to A. For each vertex v, we have one Out(y) 
for distances from v, i.e. with v as source, and one In{v) with distances to v, 
i.e. with u as a sink. For each pair (u,w) of vertices, disty{u,w) is the distance 
from u to u in In{v) plus the distance from u to ru in Out{v). 

We start with a graph with no edges. Whenever we make a multi-insert of 
edges incident to v, we reinitialize the pair {In{v),Out{v)) at amortized cost 
0{Am). When an edge is deleted, it is deleted from all pairs (In{v),Out{v)) 
created after it was inserted. 

Lemma 1. If the distance from u tow is utmost A, dist{u,w)= miiiy disty{u,w) . 

Proof. Obviously, dist(u,w) < disty{u,w) for all v. Consider an arbitrary short- 
est path from u to w and let v be the last vertex updated on this path. Then 
all the edges of this path are contained in (In{v),Out{v)) so disty{u,w) = 
dist{u, w). 
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Thus, our problem is to keep track of the vertex v minimizing distyiu, w) for each 
pair (u,w) of vertices. King’s idea is, for each possible distance d G Z\}, 

to maintain the list L{u,w,d) of vertices v with disty{u,w) = d. 

The King algorithm: Whenever a vertex u increases its distance to v in In{v), 
this increases all distances disty{u,w) < A. li d and d' are the old and the new 
distance of v, v has to be moved from L{u, w, d) to L{u, w, d'). This move takes 
constant time. Each vertex u increases its distance at most A times in In{v), 
and each time, we have to perform a move between lists for each vertex w with 
disty{u, w) < A. Thus, u pays a total cost of 0{An) for increases to its distance 
in In{v). The cost for increases to its distance in Out(v) is the same. Thus, for 
each pair (In(v),Out(v)), the total cost of distance increases is 0{An‘^), which 
is attributed to the multi-insert creating {In{v),Out{v)). 

To get the distance matrix, for each (u,w), we maintain the smallest d for 
L{u,w,d) yf 0. Then d = dist{u,w). For any vertex v in L{u,w,d), the pair 
{In{v),Out{v)) provides a shortest path from u to w. When doing a multi- 
insert, the above is trivially done by scanning all lists in 0{n^A) time. When 
doing deletions, our amortization argument allows the multi-inserts to pay for 
all distance increases, even if these are considered to be done one at the time. If 
L{u, w, d) becomes empty because some vertex v is moved to L(u, w, d'), we can 
afford to scan all the lists L{u, w,e), e = d + 1, ..., d' by charging the cost to the 
multi-inserts. 

Each pair {In{v),Out{v)) takes 0(m) space. For each (u,v,w), v is in 
L{u, w, disty{u, w)), so the total number of elements in the lists is O(n^). More- 
over, there are O(An^) = O(n^) lists. Hence, the total size of the data structure 
is O(n^). 

3.1 Improving the Space 

To reduce the space of the above data structure to 0{An“^), we need to address 
both the pairs (In{v),Out{v)) and the witness lists L{u,w,d). 

{In{v),Out{v)) For each pair {In{v),Out{v)), we wish to reduce the space 
to 0(n) using the trick from §2.1. However, that trick assumed for each vertex 
w, that we could traverse in{w) in some fixed order. That is, if {x, w) was the 
witness of w in T„, and if (x,w) was deleted, we could continue the traversal 
for witnesses from the successor of (x, w) in in{w) knowing that the same edge 
would never be considered twice for the same distance dy{w). 

To deal with the differing incidence lists, we spend 0(Z\n^), rather than 
0{Am), to maintain each pair (In{v) , Out{v)) . This does not affect our asymp- 
totic cost, since we have already incurrred this cost for each multi-insert. Hence 
we can spend 0(n) time to traverse in{w) in Out{v). 

We fix one arbitrary global ordering ^ of all the vertices. For each vertex w 
its list in{w) in Out{v) will be ordered as ^ orders its predecessors. However, 
in{w) will be stored only implicitly. We keep a global n x n incidence matrix 
which for each (x,y) G which stores its length £{x,y) or oo if (x,y) is not an 
edge. 
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We traverse in{w) in Out{v) by going through all the vertices x G V in the 
order ^ and keeping a pointer to our current location in the order. For each x, we 
check if (x, w) is an appropriate witness, i.e., if x)+£(x, w) = disty{v, w). 

For each edge (x, w) which serves as a witness, we keep a pointer back to v, so 
that when (x,w) or disty{v, x) is changed, we know to look for another witness. 

We only increment disty{v, w) after the order is traversed. We only decrement 
disty{v,w) when {In{v),Out{v)) is reinitialized. At botht times, we reset the 
pointer to the beginning of the order. Note that in this space-saving version, we 
may inadvertently add a witness which was created after the last time Out(y) 
was initialized. 

We maintain a shortest path tree of edge witnesses as described in Section 
2.1, so that nodes affected by a distance increase can be easily determined. 

As claimed, the total time to traverse the order is 0{n), giving the amortized 
cost of O(An^) for each pair {In{v),Out{v)). Our total space for all the pairs 
{In{v),Out{v)), including the global incidence matrix, is now 0{v?). 

L{u, w, d) Instead of the lists L{u, w, d) of vertices v with disty{u, w) = d, we 
maintain only the oldest witness v with disty{u,w) < d. This reduces the space 
to a constant per list, hence to to 0(Z\n^), as desired. 

The switch from ‘=’ to ‘<’ is crucial to our amortization. The point is that if 
V is not a witness for disty{u, w) < d, it will not become so before {In{v) , Out{v)) 
is replaced by a multi-insert. 

We maintain a historical list of the vertices ordered according to dates. Note 
that this order has nothing to do with the order ^ used previously. When we do 
a multi-insert of edges around v, we move v to the end of the historical list. 

Suppose d = disty{u,w) is increased. If v was the oldest witness of (u,w,d), 
we scan the historical list starting from the successor of v, stopping as soon as we 
find a younger vertex v' with disty>{u, w) = d. Then v' is the new oldest witness 
of (u,w,d), and otherwise, there is no witness. If disty{u,w) was increased by 
more than one, we repeat the above scanning if v was also the oldest witness 
for (m, w,d+ 1). As soon as we reach some value e so that v was not the oldest 
witness for {u, w, e), we stop; ii v' ^ v is the oldest witness for {u, w, e), it is also 
an older witness for all {u,w, f) with / > e. 

Since we always scan younger and younger witnesses, we see that a vertex v, 
between multi-inserts around v, can only be scanned once for each triple (u, w, d). 
So, the amortized cost of each multi-insert is 0{An?). By getting rid of most 
pointers, we also reduce the constants hidden in the O-notation. 

Below, we briefly describe how the above techniques are applied by King [9] 
to transitive closure, and leave the discussion of approximate and exact shortest 
paths to the full paper. We show that our space improvements remain valid for 
these applications. 

3.2 Transitive Closure 

King maintains the transitive closure of a digraph, in a manner similar to the 
classical method of repeated squaring of matrices. For i = 0, 1, ..., h = [logn]. 
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we maintain a digraph G* that is a subgraph of the transitive closure, and which 
is guaranteed to have an edge (y, w) if there is a dipath from u to tu in G of 
length at most 2*, i.e., G° = G, and G^ is the transitive closure. 

For t > 0, and for each vertex v € V, we maintain the pair 
Ot(t*+^(u')), with Z\ = 2. For i > 0, the edges of G*+^ are defined to be all pairs 
(u,w) such that there is a path from m to rc in (/n*(u), OuV'{v)) of length 3 or 
less. 

Lemma 2 (9). If a dipath P from u to w has length at most 2*+^ and 
the youngest vertex in the dipath is v, there is a path from u to w in 
of length 3 or less. 

Proof. Let v' be a middle vertex in P. Assume v' is in OuV‘'^^{v). Then P[u,v], 
P[v, u'], P[v' , w] are all of length at most 2*. Hence (u, v), {v, v') and {v' , w) were 
all in G* when Out*+^(u)) was last reinitialized. 

To save space, we implement the {In^^^{v'),Ouf~^^{v')) pairs as in the pre- 
vious section, with incidence matrices for i = l,...,h. When a multi-insert of 
edges around v occurs, and G° has been updated, then for z = 1, ..., ft,, we reini- 
tialize only (In'^~^^{v),Out^^^{v)) using all edges currently in GL A the multi- 
insert may create edges in G* not incident to v which do not affect older pairs 
(/n*+^(z;'), («')). 

If an edge is deleted from G, it is deleted from the incidence matrix for G°. 
For i = 1, ..., ft, as edges in G*“^ are deleted, this may result in the destruction 
of paths in {In'^~^^{v'),Out^~^^{v')) which in turn may result in the deletion of 
edges in G*+^. 

Given any edge {u, w) £ G*+^, to expand it to a path, we consider the witness 
V for distl'^^(u,w) < 3. Then (/n*+^(u), /n*+^(z;)) provide us a path of length 
< 3 from (u,w) using edges from G*, and each of these edges can be expanded 
recursively. 

To get simple paths, we combine with a linear time computation of strongly 
connected components [12]. For each strongly connected component, using an in- 
tree and an out-tree from an arbitrary vertex, we provide a simple path between 
any pair of vertices in the same strongly connected component. In linear time, 
we also label each vertex with the strongly connected component it is in. Now 
we only need to reconstruct segments between strongly connected components. 
So, if G*+^ provides us with a path of edges from G*, we only expand edge (x, y) 
if X and y are in different connected components. 

Since there are log n levels, we maintain the transitive closure in 0(n^ log n) 
amortized time per multi-insert and 0(n^ log n) space. 

4 Demetrescu-Italiano Transitive Closure 

The Demetrescu-Italiano transitive closure algorithm represents the graph as 
an adjacency matrix and achieves a log n speed-up by basing its algorithm on 
the static recursive scheme [8,10] for computing transitive closure, rather than 
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repeated squaring. The adjacency matrix of the graph is subdivided into four sub- 
matrices. The transitive closure can be shown to be a polynomial over these four 
sub-matrices. [4,3] show that the problem of updating a polynomial over Boolean 
matrices can be reduced to updating the product of degree 2 polynomials, with 
some errors allowed, as described below. Solving this problem in 0{n?) space 
gives an 0{n?) space bound for the whole algorithm. 

Let Ml and M 2 be two n x n Boolean matrices. In computing their product, 
there are two kinds of updates allowed to either matrix in which O’s are changed 
to I’s: one where any entry can be set to 1 and one where just entries in a 
particular row i and column i can be set to 1. The first is done by LazySet, the 
second by SetRow(z, M) and SetCol(z, M), respectively. In addition, I’s can be 
changed to O’s by ReSet. The “error” allowed is that in certain circumstances, 
the bits set by LazySet can be ignored. 

When entries (x,y) in Mi and {y,z) in M 2 are both 1, we call these entries 
a witness pair for {x, z), and y a witness for (x, z). A witness pair (x, z) cannot 
be ignored if its two entries {x, y) and (y, z) exist currently and they existed 
since a time when one of them was contained in a row or column updated by 
a SetRow or SetCol. That is, the problem is to maintain for each entry in the 
matrix product a witness pair (x, y, z) if one exists and was created at or before 
the most recently executed SetRow{x, Mi), SetCol{y, Mi), SetRow{y, M 2 ), or 
SetCol{z, M 2 ). 

The goal is to spend no more than 0{n^) time per SetCol and SetRow. 
Essentially, LazySet does nothing except change bits in Mi and M 2 ; SetCol and 
SetRow do the work. The cost of ReSet is amortized against the operations which 
change the O’s to I’s. 

The implementation of this in 0{n^) space is straightforward, and involves 
the keeping of lists of relevant witnesses. 

Our result: We can accomplish the multiplication of two matrices (with the 
allowable errors)more easily while maintaining witnesses. We do this with no 
change to the asymptotic time, maintaining a cost of 0{n^) time per SetRow 
and SetCol, and space O(n^). We use the technique of the previous sections, of 
keeping a pointer to the next available witness. We keep: 

- For each x, z, a pointer Pl{x, z) which is initially set to the first column of y 
in Ml such that Mi{x, y) AND M 2 {y, z)] these two entries are a witness pair for 
{x,z) 

- For each x, z, a pointer P2(x, z) which is initially set to the first row y in M 2 
such that Mi{x,y) AND M 2 {y,z)] these are a witness pair for (x,z). 

~ A queue of y ’s ordered by order of last execution of either SetCol(j/, Mi) or 
SetRow(y, M 2 ). 

- For each (x,z), a pointer P2>{x,z) which is initially set to the first y in the 
queue such that Mi{x,y) AND M2{y,z) 

~ Back pointers from the entries which are witnesses back to the {x, z) for which 
they are a witness. 

To do updates: 

When SetRow(x) in Mi is executed, we reset Pl{x, z) to the beginning for all 2 :. 
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When SetCol(z) in M 2 is executed, we reset P2{x,z) to the beginning for all x. 
When Mi{x, y) or M 2 {y, 2 :)is set to 0 during a ReSet: 

- If Pl(x,z) points to y, then we move Pl{x,z) forward until another witness 
is found or the end of the row. 

- If P2(x, z) points to y then we move P2{x, z) forward until another witness is 
found or the end of the column. 

- If P3{x, z) points to y, then we advance P3{x, z) until another witness is found. 
Running time analysis: Each time a pointer is set back to the beginning of a 
row or column, there may be another n checks at a cost of 0(1) each. P2>{x, z) 
is never set back but there may be an added cost of 0(1) for every pair (x,z) 
for each queue change. We charge the SetCol or SetRow to cover this. 
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Abstract. In an undirected, 2-node connected graph G = {V, E) with 
positive real edge lengths, the distance between any two nodes r and s 
is the length of a shortest path between r and s in G. The removal of a 
node and its incident edges from G may increase the distance from r to 
s. A most vital node of a given shortest path from r to s is a node (other 
than r and s) whose removal from G results in the largest increase of the 
distance from r to s. In the past, the problem of finding a most vital node 
of a given shortest path has been studied because of its implications in 
network management, where it is important to know in advance which 
component failure will affect network efficiency the most. In this paper, 
we show that this problem can be solved in 0{m + nlogn) time and 
0(m) space, where m and n denote the number of edges and the number 
of nodes in G. 



1 Introduction 

The computational infrastructure throughout society is becoming increasingly 
large and complex. Networks of workstations are vulnerable to attack and failure, 
and it is generally recognized that the survivability of our computing systems is 
a critical issue (IEEE Computer dedicated its August 2000 issue to this topic). 
We are interested in a particular type of survivability: How is a communication 
network affected by the failure of a component? In this paper, we consider the 
effect that a node failure will have on a shortest path between two nodes. Our 
scenario assumes that each message is routed along a shortest path in a com- 
munication graph from its source to its destination. When a node on that path 
fails, we need to replace the old route by a new one, preferably by a shortest 
path in the graph that does not contain the failed node. Let us call this new 
route a replacement (shortest) path; it will in general be longer than the path 
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it replaces, but it certainly will never be shorter. From a network management 
point of view, it is desirable to know for a shortest path ahead of time which 
node failure will result in the longest replacement path. Such a node is called a 
most vital node, because its failure degrades the transmission from the source to 
the destination most strongly. 

The problem of finding a most vital node of a shortest path has been defined 
and motivated by Corley and Sha [3]. More precisely, they considered the more 
general problem of finding the k most vital nodes of a shortest path, that is the 
k nodes whose removal will increase the distance between the source and the 
destination node the most, and they gave an exponential algorithm for solving 
the problem. For the case fc = 1, an efficient implementation of their algorithm 
requires 0{mn + log n) time, where m and n denote the number of edges and 
the number of nodes in the underlying graph. Note that this is not better than the 
trivial bound that we get by recomputing from scratch the replacement shortest 
path for every node along the given shortest path. Later on, Bar-Noy et al. [2] 
proved that, for arbitrary k, the problem is strongly NP-hard. Finally, Venema 
et al. [13] studied the problem for fc = 1 in a parallel computing environment, 
providing a polynomial algorithm. 

In a related scenario, nodes are reliable, but edges can fail. The problem of 
finding a most vital edge on a shortest path has been studied extensively in the 
past: We look for an edge whose failure leads to the longest replacement path 
[1,2,3]. Now, naturally a replacement path is just a path avoiding the failed edge. 
Let us assume that the source and destination nodes lie in a 2-edge connected 
component of the given graph; otherwise, a most vital edge is just a bridge 
between these nodes, and that is easy to find. The fastest algorithm to compute 
most vital edges on a pointer machine runs in 0(m+n log n) time and 0(m) space 
[6]. On a RAM, there is an algorithm to solve the problem (and an interesting 
variant of it defined in [7]) in 0{m ■ a{m,n)) time and 0(m) space [8], where 
a{m, n) denotes the functional inverse of the Ackermann function defined in [12]. 

In this paper, we show that the problem of finding a most vital node for a 
shortest path in a 2-node connected, undirected and positively weighted graph 
can be solved on a pointer machine in 0(m -I- n log n) time and 0(m) space. The 
efficiency of our algorithm is based on two considerations. First, we make use 
of specific structural properties of replacement paths in the computation. This 
is realized by means of a priority queue that stores certain distance values for 
certain nodes. Unfortunately, the priority queue contains distance values that 
would lead to an incorrect result if they ever would be used, and it also contains 
the desired distance values leading to the correct result. The reason for this 
mix is algorithmic performance: We have no way of efficiently distinguishing 
between both, but we make sure that the algorithm never uses the undesired 
values. Second, we perform this computation incrementally as we visit the nodes 
along the shortest path. 

The paper is organized as follows: In Section 2 we define the problem formally 
and give the required basic definitions. In Section 3, we present the structural 
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properties of replacement paths, along with our algorithm. Finally, Section 4 
discusses modifications and further applications, and lists some open problems. 

2 Basic Definitions 

Let G = (V, E) be an undirected graph, where V is the set of nodes and E C 
X y is the set of edges. Let n and m denote the number of nodes and the 
number of edges, respectively, and, for each e € E, let w(e) be a positive real 
length. A graph ff = (V(H), E(H)) is called a subgraph of G if V{E[) C V and 
E{H) C E. If V{H) = V then H is called a spanning subgraph of G. 

A simple path (or a path for short) in G is a subgraph P with V (P) = 
{vo,vi, . . . ,Vk\vi yf Vj for i ^ j} and E{P) = {(uj,Wi+i)|0 < i < k}, also 
denoted as P{vQ,Vk) = {vq, . . . ,Vk)- Path P{vo,Vk) is said to go from vq to Vk 
or, alternatively, to be between vo and Vk- Its length is the sum of the lengths of 
the path edges, and will be denoted as |P(uo: Vfc) |. A graph G is connected if, for 
any two nodes u,v € V, there exists a path in G going from utov.A connected 
acyclic spanning subgraph of G is called a spanning tree of G. Let G — v denote 
the graph obtained by removing from G the node v and its incident edges. A 
graph G is 2-node connected if for any v & V , G — v is connected. 

A path between two nodes r and s is shortest in G if it has minimum length 
among all the paths in G between r and s. In this case, we denote the path by 
PG{r,s), and its length, also known as the distance in G between r and s, by 
dG{r,s). For a distinguished node r € V, called the source, and all the nodes 
u G y \ {r}, a single-source shortest paths tree (SPT) Scir) in G is a spanning 
tree of G rooted in r and formed by the union of shortest paths, with one shortest 
path from r to u for each v G y \ {r}. 

Let Pg-v{‘>', s) be a shortest path between r and s in G — u, named a replace- 
ment shortest path for v, and let dc-vir, s) denote its length. The most vital node 
(MVN) problem with respect to Pair, s) asks for finding a node v* £ V \ {r, s} 
such that dc-v* {r, s) > dc-v(r, s), for any v G y \ {r, s}. 

3 An Efficient Solution of the MVN Problem 

Let Pg{t, s) = (uo, ui, . . . , Vk) be a shortest path between r = vq and s = Ufc in G. 
First of all, notice that a node (other than r and s) whose removal increases the 
distance between r and s must belong to the node set {ui, . . . , Vk-i}- Therefore, 
in the following, we will consider only the removal of the nodes along the path. 

3.1 The Structural Properties of Replacement Shortest Paths 

In this section, we present the structural properties of replacement shortest paths 
that will form the basis for the efficiency of our algorithm. 

Let SG{r) denote a SPT in G rooted at r and containing PG{r,s), and let 
Vi,l < i < k — 1, he a, node on PG{r, s). When node Vi and its incident edges 
are removed from G, SG{r) is partitioned into a set of subtrees, that we classify 
as follows (see Figure 1): 
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Fig. 1. Node Vi is removed from G: Sa{r) is partitioned into a set of subtrees, with 
node sets Ui, Oi and Vi. 



1. the subtree of Sa{r) containing the parent Uj_i of vf, we call the nodes of 
this subtree the upwards nodes of Vi, and we denote them as Up, 

2. the subtree of Sc{r) containing the child Vi+i of vp we call the nodes of this 
subtree the downwards nodes of Vi, and we denote them as Vi, 

3. all the remaining subtrees of Sa{r); we call the nodes of the union of all 
these subtrees the outwards nodes of Vi, and we denote them as Oi. 

In the rest of the paper, we will make use of the following properties, that 
hold for i, j = 1, . . . ,k — 1 and j ^ i: 

(PI:) U^GO^\JV, = V\{v,y, 

(P2:) Ui,Oi and Vi are pairwise disjoint; 

(P3:) U,GU,+p 
(P4:) Vi+i C Vp 
(P5:) O, n Oj = 0. 

We start by observing that for nodes in Ui, the shortest path to r does not 
contain Vi. This immediately implies the following: 

Lemma 1. For each node u € Ui, da{r,u) = dG-vi{r,u). □ 

On the other hand, for nodes in Vi, we have that the distance to s does not 
change when Vi is removed: 

Lemma 2. For each node u € Vi, dG(s,u) = dG-vi(s,u). 

Proof. Suppose, for the sake of contradiction, that dG{s,u) yf dG-vi(s,u). Let 
Sg{s) be a SPT in G rooted in s containing Pg^u, s). From dG{s, u) yf dG-Vi(s, u). 
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it follows that every shortest path in G between s and u, in particular Pg{s,u) 
in S'g(s), contains Vi. Therefore, Pg{s,u) has to contain its parent Vi+\ in Sg{s). 
Hence, the edge e* = {vi, fi+i) belongs to Pa{s, u), and since subpaths of shortest 
paths are shortest paths, we have that 

d,G{vi+i,u) = w(ei) + dG(vi,u) > dG(vi,u). 

On the other hand, by the fact that u € T>i, we have that PG(r,u) in 5 'g( 7’) 
contains Vi and fi+i- Hence, since subpaths of shortest paths are shortest paths, 
we have that 



dG(vi,u) = w{ei) + dG{vt+i,u) > dG{vi+i,u), 

that is, we have a contradiction. □ 

Let E{Ui U Oi,T>i) C E he the cut induced by Vi in G — Vi, i.e., the set of 
edges of G — Vi with one end node in I4i U Oi and the other one in Vi. In the 
following, an edge / = (x,y) G E{Ui UOi,Vi) will be considered as having node 
y in Vi. Edges in E{UiLlOi, Vi) will be named the crossing edges associated with 
Vi. Since any replacement shortest path for node Vi must use a crossing edge, 
the length of the path can be expressed as follows: 

dG-vi{r,s)= min {dG-vi{r,x) + w{f) + dG-vAy, s)}. (1) 

f={x,y)eE(UiUOi,T>i) 

This immediately suggests an algorithm to solve the MVN problem, but 
unfortunately we do not know how to compute all the dG-vi{r,x) distances 
sufficiently fast, since this might require the computation, for each node Vi on 
Pg{x, s), of the SPT rooted in r in G — Vi. Therefore, let us look at replacement 
shortest paths more closely. In a path in G — Vi from r to s, let us call the path 
node y & Vi closest to r the entry node (into Vi) of the path. We can prove that, 
to minimize (1), not all the distances in G — between r and nodes in Ui U Oi 
need to be computed: 

Lemma 3. Any replacement shortest path for node Vi can he expressed as a 
concatenation of PG-vi-Vi(x,x), edge {x,y) and PG{y,s), where y is the entry 
node into Vi, PG-vi-T>i{x, x) is a shortest path from r to x in G — Vi — Vi, and 
PciUj s) is a shortest path from y to s in G. 

Proof. Since r is not contained in Vi, but s is, there is a first node on the 
path traced from r towards s that belongs to Vi. Call that node y, and call its 
predecessor on the path x. This proves the first and second part of the claim. 
Part three is due to Lemma 2. □ 

3.2 Computing Components of Distances 

Lemma 3 allows us to compute replacement shortest paths as follows. First, we 
compute in G a SPT rooted in s. This gives us all distances dG-Vi{y,s) for 
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y € T>i,i = — 1. Second, we compute all paths Pc-Vi-Vii'i',x) from r 

to X G l/i U Oi,i = 1, . . . , k — 1. This is described in more detail in the follow- 
ing paragraph. Third, we compose these distance components by inspecting all 
crossing edges as we go along the nodes Vi,i = 1, . . . , k — 1. This is described in 
more detail in Section 3.3. 

For X GlAi, from Lemma 1 we have dc-Vi-T>i{x, x) = dcixjx), and therefore 
the SPT rooted in r that contains Pair, s), denoted as Sa{r), gives us all these 
values. The remaining more interesting task is the computation of dc-vi-Vi (x, x) 
for X G Oi- We propose to do this as follows, making use of Sa{r). When node 
1, is removed, we consider the subtree of Sc{r) induced by Ui - 
which is of course a SPT rooted in r of the subgraph of G induced by the node set 
Ui- Then we compute the distance from r to all the nodes in Oi in the subgraph 
of G induced by UiVdOi. We do this by applying Dijkstra’s algorithm [4] in the 
following way to the nodes in Oi, starting from the precomputed distances for 
Ui- Let E{Ui,Oi) be the subset of edges in E having one end node in Ui and 
the other one in Oi, let E{Ui, x) be the subset of edges in E(Ui, Oi) having one 
end node in Ui and the other one ixi x G Oi, and let E{Oi,Oi) be the subset of 
edges in E having both end nodes in Oi. We create an initially empty heap Hi, 
inserting into it all the nodes x G Oi, with key 



f min {dG-Vi{r,u) + w{f)} if E{Ui,x) ^ il); 
k{x) = < f=G,x)^E(Ui,x) 

1 -|-oo otherwise. 



(2) 



Afterwards, we extract the minimum k{x) from Hi] this gives us dc-vi-Vt (x,x). 
Then, we update the keys of adjacent nodes still appearing in Hi, by making use 
of edges in E{Oi,Oi), exactly as in Dijkstra’s algorithm. The algorithm iterates 
until Hi is empty. 

This algorithm has an efficient implementation, as expressed in the following 
lemma: 



Lemma 4. The values dc-vi-Vi (j, x) for all nodes x G Oi,i = 1, . . . ,k—l, can 
he computed in 0{m + nlogn) time and 0{m) space. 

Proof. The initial computation of Scir) takes 0{m + nlogn) time and 0{m) 
space [5]. Throughout the k — 1 iterations in our algorithm, k — 1 = 0(n) heaps 
are created. Let Ui denote the number of nodes of Oi, and let mi = \E{Ui,Oi)U 
E{Oi, Oi)\. On heap Hi, we perform 0{ni) Insert operations, and from Lemma 1, 
key initialization can be performed in 0{mi) time once Sg{t) has been computed. 
Moreover, we perform 0{ni) ExtractMin and 0{mi) DecreaseKey operations. By 
using Fibonacci heaps [5], for all the nodes x G Oi, dc-y.-x).(r, x) can then be 
computed in 0(mi -I- n^ log n^) time. 

Since each DecreaseKey operation is associated with an edge of G, and each 
edge of G is considered at most twice, and given that sets Oi are disjoint, we 
finally have that the total time is 

fc-i 

0{mi + Ui log Ui) = 0{m + nlog n). 




284 Enrico Nardelli, Guido Proietti, and Peter Widmayer 



3.3 Combining Components of Distances 

We are now ready to combine the distance components computed so far. We first 
give a description of the algorithm, and we then analyze its correctness and its 
time and space complexity. 

We consider the nodes ui, . . . , Vk-i in this order along Pair, s), and when the 
node Vi is considered, we maintain in a heap 'H the set of nodes T>i associated 
with it. For each node y £T>i, we consider the subset of edges in E{Ui UOi,T>i) 
incident to y, denoted as E{Ui U Oi,y). In the heap, with node y a key k{y) is 
associated that satisfies the following condition immediately before a FindMin 
operation on "H is performed: 

AdG-vi--Di{r,x) + w{f)+dG{y,s)}. (3) 

f={x,y)eE{UiUOi,y) 

Notice that in general, this key value is not the length of a shortest path in 
G—Vi from r to s through y, but, as we explained in Section 3.1, we cannot afford 
to maintain these latter values. We will show later that these keys, however, give 
us sufficient information to solve the problem. 

The algorithm works in stages. At the beginning, the heap "H is created for 
Vq, that is, all the nodes in the subtree T>o rooted at Vi in So{r) are inserted, 
with arbitrarily large keys associated. At the zth stage, we consider the node vt 
on Pair, s), and we update the heap in the following way: 

Step 1 : We remove from T~L the node Vi and the nodes Oi associated with it. 
(Comment: This leaves exactly the nodes in Pi in 'H; we update their keys 
in Steps 2 and 3.) 

Step 2: We consider all the nodes in Oi; for each such node x, we inspect its 
incident edges, and we limit further actions on those crossing into Pi. Let 
/ = {x, y) be one of these crossing edges, if any, and let 

k' = da-vi-Vi {r, x) + w{f) + dc{y, s), (4) 

where dG-vi-Vi{r,x) has been computed by means of the procedure de- 
scribed in Section 3.2. If k' < k{y), we decrease the key of y in "H to value 
k' . (Comment: When this step is completed, all the crossing edges associated 
with Vi and induced by its removal have been exhausted.) 

Step 3: We then consider all the nodes in Vi-i U Oi-\; for each node x in 
this set, we look at its incident edges, and we limit further actions on those 
crossing into Pi. Let / = (x, y) be one of these crossing edges, if any, and let 

k' = dG{r,x) + w{f)+dG{y,s). (5) 

If k' < k{y), we decrease the key of y in "H to value k'. (Comment: When 
this step is completed, all the crossing edges associated with Vi and induced 
by the reinsertion of Vi-i have been exhausted, and the corresponding key 
maintenance in the heap is complete.) 
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Step 4: We finally find the minimum of %. (Comment: We will prove shortly 
that the key associated with this minimum is exactly the length of a replace- 
ment shortest path va G — Vi between r and s, that is do-vi{r, s).) 

When all stages 1, . . . , fc — 1 have been completed, a most vital node can then 
be determined as a node Vj on Pair, s) such that 

d,G-vAr,s)= max s)}. (6) 

Let us now prove that our algorithm indeed computes at each stage the length 
of a corresponding replacement shortest path. 

Lemma 5. The minimum key found in TL at the ith stage is the length of a 
replacement shortest path between r and s in G — Vi- 

Proof. We prove the lemma in two steps. First, we prove that each key in the 
heap TL, say k{y) for node y, is the length of a shortest path in G — Vi from r to s 
through the entry node y. The reason is that our algorithm inspects all crossing 
edges {x, y) incident to y, and keeps track of the best. 

Second, we prove that at least one node in the heap has a key corresponding 
to the length of a replacement shortest path between r and s in G — Vi. In 
fact, for any replacement shortest path P in G — vi, the corresponding entry 
node y is in "H. Let x be its predecessor on P. Then, k{y) equals the length of 
P, because the prefix of P from r to a: is contained in G — Vi — T>i, and then 
dc-Vi-T’iir, x) = dG-Vi(r,x). Therefore, (1) and (3) are both minimized when 
edge (x,y) is considered, and k{y) = dG-vAf, s). □ 

The following theorem can finally be proved: 

Theorem 1. A most vital node on a shortest path PgA, s) between two nodes r 
and s in a 2-node connected, undirected graph G = (V, E) with n vertices and m 
edges, with positive real edge lengths, can be determined in 0{m nlogn) time 
and 0{m) space. 

Proof. The correctness of the above algorithm derives from Lemma 5. The time 
complexity follows from that of Lemma 4 for the initial phase. This allows us 
to compute (4) in 0(1) time for each crossing edge. Clearly, (5) can be com- 
puted in 0(1) time for each crossing edge as well, once SgA) and Sg{s) have 
been computed. Globally, we perform 0{m) computations of (4) and (5), since a 
crossing edge is checked at most twice, once each in Step 2 and Step 3. Then, 
we make use of a Fibonacci heap [5] for maintaining TL. Since each node of G is 
inserted into the heap and removed from it at most once, we have a single Make- 
Heap, 0(n) Insert, k — 1 = 0{n) FindMin, 0(n) Delete and 0{m) DecreaseKey 
operations (since a key may be decreased only when a new crossing edge is con- 
sidered), and thus we obtain a total time of 0{m -\- nlogn) for heap operations. 
The time complexity for other tasks is respectively 0(m-\- nlogn) time for com- 
puting SgA) and Sg{s), 0(n) time for managing sets Oi, i = 1, . . . , k — 1, and 
0(n) time for computing (6). Finally, 0(m) space is trivially enough to handle 
all the operations. Thus, the claim follows. □ 
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4 Discussion 

In this paper we have presented a fast solution to the problem of finding a most 
vital node along a shortest path Pair, s) between two nodes r and s in a graph 
G. Implicitly, our algorithm computes not only the lengths of replacement paths, 
but also the paths themselves; it can be easily modified to do so explicitly. It 
runs in 0(m + nlogn) time and 0(m) space, which, as far as we know, is the 
first improvement over the trivial bound of 0{nm + n^logn) time and 0{m) 
space that we get by recomputing a replacement shortest path between r and s 
from scratch after the removal of each node along Pair, s). 

In some applications, such as transportation networks, it appears to be more 
realistic to associate costs with both, nodes and edges, instead of only one type of 
network components. Our approach also answers the corresponding more general 
question that suggests itself: In a graph where both edges and nodes have a non- 
negative cost, and where both edges and nodes can fail, what is a most vital 
edge or node on a shortest path? The algorithm can be modified slightly and 
still runs within the same asymptotic bounds for this more general question, for 
two reasons. First, edge failures can be modelled as node failures, when each 
edge is replaced by a path of length two with an extra node in the center of that 
path; then, the failure of the extra node represents the failure of the original edge. 
Second, Dijkstra’s algorithm can be adapted easily to work also for shortest path 
computations in graphs with costs on edges and nodes, where the cost of a path 
is the sum of the costs of its edges and nodes. Obviously, both modifications do 
not change the asymptotic bounds for the runtime and the storage space. 

Our algorithmic solution is also useful in quite a different application con- 
text. In large networks, components (nodes and edges) may be owned by different 
owners. The incentive of an owner of a component to forward a message, nat- 
urally, is to get some reward. In standard economic terms, that reward is the 
price of the service of forwarding the message. It is economically desirable that 
each owner declares the true price for the service that its component offers, so as 
to allocate the overall resources in a best possible way. Nevertheless, there is an 
incentive for owners to speculate and ask for a higher price, in the hope of getting 
a higher profit. This leads to economically suboptimal resource allocation and 
is therefore undesirable. A few studies in the computer science literature have 
devoted their attention to setting the boundary conditions in such a way that 
speculating with high prices does not pay off. This is known as mechanism de- 
sign for selfish agents [9,10,11]. In [10], Nisan is explicitly suggesting a rewarding 
model for forwarding messages on paths, based on microeconomic theory, that 
requires the computation of replacement path lengths for edges. This model as- 
sumes that only edges charge a price for forwarding a message; nodes perform 
their service for free. Here, again, it would be more realistic to have a price for 
both, edges and nodes, than a limitation of the pricing to the edges alone. A 
straightforward modification of the charging scheme from [10] to node and edge 
prices serves the desired purpose. Now, the modification of our algorithm for 
node and edge costs and failures is an efficient implementation of the required 
replacement path computations. 
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Our solution is efficient, but it is still open whether it is optimal. Notice 
that to improve our solution, a faster computation of a single source shortest 
paths tree must be provided. For more general settings there are still many open 
problems; one of them deals with multiple edge or node failures on a shortest 
path. 
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Abstract. Let G be a set of colors, and let w be a cost function which 
assigns a real number cu(c) to each color c in C. An edge-coloring of a 
graph G is to color all the edges of G so that any two adjacent edges are 
colored with different colors. In this paper we give an efficient algorithm 
to find an optimal edge-coloring of a given tree T, that is, an edge- 
coloring / of T such that the sum of costs w(/(e)) of colors /(e) assigned 
to all edges e is minimum among all edge-colorings of T. The algorithm 
takes time 0{nA^) if n is the number of vertices and A is the maximum 
degree of T. 



1 Introduction 

A vertex- coloring of a graph G = (V, E) is to color all the vertices of G so 
that any two adjacent vertices are colored with different colors. We denote by 
n the number of vertices in G. Let G = {ci, C 2 , • • • ,Cm} be a set of colors, and 
assume that m = \G\ is sufficiently large, say m = n. Let w : C — >■ R be a 
cost function which assigns a real number uj{c) G R to each color c G G. The 
cost vertex-coloring problem is to find a vertex-coloring / : U — >■ C of G such 
that '^(/(^)) small as possible. One can observe that the cost vertex- 

coloring problem is NP-hard since the ordinary vertex-coloring problem is NP- 
hard. The cost vertex-coloring problem remains NP-hard for interval graphs, but 
the problem can be solved in linear time for trees [9] . The “vertex-chromatic sum 
problem” in [10,11] is merely the cost vertex-coloring problem for a special case 
where w(cj) = i for each index i>l. Jansen gives a polynomial-time algorithm 
to solve the vertex-chromatic sum problem for graphs of bounded treewidth [5] . 

In this paper we introduce the cost edge-coloring problem. An edge-coloring 
f : E ^ G of a graph G = (U, E) is to color all the edges of G so that any two 
adjacent edges are colored with different colors. The minimum number of colors 
required for edge-colorings of G is called the chromatic index, and is denoted 
by x'(G). An ordinary edge-coloring problem is to compute the chromatic index 
x'(G) of a given graph G and find an edge-coloring of G with yf{G) colors. On 
the other hand, the cost edge-coloring problem is to find an optimal edge-coloring 
of G, that is, an edge-coloring / such that Eess minimum among all 
edge-colorings of G. An optimal edge-coloring does not always use the minimum 
number x'(G) of colors. For example, suppose that w(ci) = 1 and oj{ci) = 5 
for each index i > 2, then the graph G with x'(G) = 3 in Figure 1(a) can be 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 288-297, 2001. 

© Springer- Verlag Berlin Heidelberg 2001 
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Fig. 1. (a) Edge-coloring with ~x'{G) colors and (b) optimal edge-coloring with 
x'(G) -I- 1 colors. 



uniquely colored with the three cheapest colors ci,C 2 and C 3 as in Figure 1(a), 
but the edge-coloring is not optimal; an optimal edge-coloring of G uses the four 
cheapest colors ci, 02,03 and 04 as depicted in Figure 1(b). The edge- chromatic 
sum problem introduced by Giaro and Kubale [4] is merely the cost edge-coloring 
problem for a special case where w(c^) = i for each index z > 1. The cost edge- 
coloring problem has a natural application in scheduling theory. Consider the 
scheduling of unit execution time of biprocessor tasks on dedicated machines. An 
example of such tasks is the file transfer problem in a computer network in which 
each file engages two corresponding nodes, sender and receiver, simultaneously 
[3] . Another example is the biprocessor diagnostic problem in which links execute 
concurrently the same test for a fault tolerant multiprocessor system [ 6 ]. These 
problems can be modeled by a graph G in which machines correspond to the 
vertices and tasks correspond to the edges. An edge-coloring of G corresponds 
to a schedule, where the edges colored with color Ci G C represent the collection 
of tasks that are executed in the zth time slot. For each i, if a task is executed in 
the zth time slot, then it takes the cost uj{ci). The goal is to find a schedule that 
minimizes the total cost. This corresponds to the cost edge-coloring problem. 

One can observe that the cost edge-coloring problem is NP-hard since the 
ordinary edge-coloring problem is NP-hard. Furthermore, the cost edge-coloring 
problem remains NP-hard for bipartite graph, because the edge-chromatic sum 
problem is NP-hard for bipartite graphs [4]. When restricted to trees, many 
NP-hard problems can be efficiently solved in polynomial time [1,2,13,14]. Giaro 
and Kubale recently give an algorithm to solve the edge-chromatic sum problem 
for trees T in time 0(nZ\^'® log n), where n is the number of vertices and A 
is the maximum degree of T [4]. However, it is not known whether the cost 
edge-coloring problem can be solved in polynomial time for trees. 

In this paper we give an algorithm to solve the cost edge-coloring prob- 
lem for trees T in time 0{nA'^). The algorithm hence runs in linear time if 
Z\ is a fixed constant. The algorithm takes time 0(nZ\^'^ log(Z\iV)) if all the 
color costs w(c) are integers in the range [— A^j. The edge-chromatic sum 
problem is a special case of the cost edge-coloring problem in which N = n. 
Our algorithm can therefore solve the edge-chromatic sum problem for trees 
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in time 0(nZ\^’® log n). The time-complexity 0{nA^-^ logn) is better than the 
time-complexity 0{nA^-^ log n) of Giaro and Kubale’s algorithm [4]. We also 
show that any tree T has an optimal edge-coloring using exactly x'(T) = A{T) 
colors. 

2 Preliminaries 

In this section we define some terms and present easy observations. Let T = 
(V, E) denote a tree with vertex set V and edge set E. We will use notions 
such as root, child, descendant and leaf in their usual meaning. We denote by 
n the number of vertices in T. T is a “free tree,” but we choose for the sake of 
convenience an arbitrary vertex as a root r, and regard T as & “rooted tree.” 
The degree deg{v) of vertex v is the number of edges incident to v. We denote 
the maximum degree of T by A(T) or simply by A. We denote by d{v) the 
number of children of v in T. Then d{v) = deg(y) if u = r, and d{v) = deg{v) — 1 
otherwise. An edge joining vertices u and v is denoted by (u,v). For a vertex v 
of T, the subtree of T induced by v and all descendants of v is called the subtree 
ofT rooted at v, and is denoted by T„. Clearly, T = Tr for the root r. 

Since tree T has n — 1 edges, one may assume without loss of generality that 
the color set C satisfies \C\ = n— 1. We however assume for the notational sake 
of convenience that \C\ = n, and write C = {ci,C 2 ,--- ,c„}. An edge-coloring 
/ : A — >• C of a tree T is to color all edges of T by colors in C so that any 
two adjacent edges are colored with different colors. Let w : C — >■ R, where R 
is the set of real numbers. One may assume with loss of generality that uj is 
non-decreasing, that is, w(ci) < w(ci+i) for any index i, 1 < i < n. The cost 
oj{f) of an edge-coloring f of a tree T = (V,E) is defined as follows: 

eeE 

An edge-coloring / of T is called an optimal one if w(/) is minimum among all 
edge-colorings of T. The cost edge-coloring problem is to find an optimal edge- 
coloring of a given tree T. The cost of an optimal edge-coloring of tree T is called 
the minimum cost of tree T, and is denoted by uj(T). 

Let / be an edge-coloring of T. For each vertex v of T, let C(f, v) be the set 
of all colors that are assigned to edges incident to v, that is, 

C{f,v) = {/(e) I e is an edge incident to v in T}. 

We say that a color c £ C is missing at v if c ^ C{f,v). Let Miss(/, u) be the 
set of all colors missing at v, that is, 

Miss{f,v) = C-C{f,v). 

Interchanging colors in an alternating path is one of the standard techniques 
of an ordinary edge-coloring, which we also use in the paper. Let / be an edge- 
coloring of a tree T = (V,E), let Ci and cj be any two colors in C, and let 
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T{a,Cj) be the subgraph of T induced by all the edges colored with a and Cj. 
Since T is a tree, each connected component of T{ci, Cj) is a path of length one 
or more, whose edges are colored alternately with Cj and Cj. We call such a path 
a CiCj- alternating path. A vertex u G F is an end of a CjCj-alternating path if 
and only if exactly one of Ci and Cj is missing at v. If v is an end of a CiCj- 
alternating path, then the path is denoted by P{v,Ci,Cj). The set of all edges 
in P{v;Ci,Cj) is sometimes simply denoted by P{v;Ci,Cj). Interchanging colors 
Ci and Cj in P(v;Ci,Cj), one can obtain another edge-coloring /' of T. We then 
have the following lemma on /'. 

Lemma 1. If a G Miss(/, u), Cj G C{f,v), and i < j, then 
Ci G C{f',v), Cj G Miss(/',u), and C{f',v) U {cj} = C{f,v) U {a}. 

Proof. The colors of all edges assigned by / are the same as those by /' except 
for the edges in P{v; Ci, Cj). The first edge in P{v; Ci, Cj) is colored with Cj by /, 
but it is colored with a by /'. We therefore have Cj G C{f',v), Cj G Miss{f',v), 
and C{f,v) U {cj} = C{f,v) U {cj}. Since w(cj) < uj(cj) and P{v, Ci,Cj) starts 
with an edge colored with Cj, we have 

XI ‘^(/'(e)) < X ^(/(e))- 

e€P{v;Ci,Cj) e€P{v;Ci,Cj) 

We thus have w(/') < w(/). Q.S.V. 



3 Algorithm 

In this section we prove the following theorem. 

Theorem 1. An optimal edge-coloring of a tree T can be found in time 0{nA'^) 
if T has n vertices and A is the maximum degree of T. 

In the remainder of this setion we prove Theorem I. In Section 3.1 we give 
an algorithm to compute the minimum cost w(T) of a given tree T, and in 
Section 3.2 we give an algorithm to find an optimal edge-coloring of T. 

3.1 Computing the Minimum Cost oj{T) 

A “dynamic programming method” is a standard one to solve a combinatorial 
problem on trees. We also use it, and compute the minimum cost ui{T) of a given 
tree T by the “bottom-up tree computation.” However, one can observe that the 
minimum cost of a subtree T„ rooted at a vertex v cannot be computed 

directly from the minimum costs uj(Tyj), 1 < j < d(v), of all the children Vj of 
V. (See Figure 2.) 

Our first idea is to introduce a new parameter w(T„, i) defined for each vertex 
V of T and each color G C as follows: 

oj{Ty,i) = min{o;(/) | / is an edge-coloring of T„ and Ci G Miss(/, v)}. 
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Fig. 2. Subtree T„ rooted at v. 



Since T has exactly n—1 edges and \C\ = n, Ty has an edge-coloring / in which Ci 
is not used by / and hence Ci G Miss(/, v). Thus uj{Ty,i) is well-defined. Clearly, 

w(T„) = min uj(Ty,i). 

l<i<n 

We compute the values w(T„, i) for all indices i,l <i <n, from leaves to root r. 
Thus the DP table for each vertex v consists of the n values uj{Ty,i), I < i < n, 
and has a size n. 

The second idea is to reduce the size from n to d{v) + 1. We first have the 
following two lemmas on the properties of uj(Ty,i). 

Lemma 2. For any vertex v of tree T, to(Ty,i) is non-increasing with i, that is, 
uj{Ty,i) > uj{Ty,i 1) for each index i, 1 < i < n — 1. 

Proof. Let / be an edge-coloring of T„ such that Ci £ Miss(f,v) and w(/) = 
uj{Ty,i). If Ci +1 G Miss(/, w), then the definition of Lo{Ty,i 1) implies that 
ui{Ty,i) = oj{f) > ui{Ty,i 1). One may thus assume that c^+i G C{f,v). 
Then, interchanging colors Ci and c^+i in P(u; c,, Cj+i), we obtain another edge- 
coloring /' of T. By Lemma 1 oj{f) > oj{f') and c^+i G Miss(/',v). Therefore 
by the definition we have 

u){Ty, i) = Lo{f) > Uj{f) > u}{Ty, i 1). 



Q.£.V. 



Lemma 3. For any vertex v of tree T, 

uj{Ty,d{v) -I- 1) = uj{Ty, d{v) -I- 2) = • • • = w(T„, n). 

Proof. Let i be any index such that d(y) 2 < i < n. Then by Lemma 2 
uj{Ty,d{v) -I- 1) > uj{Ty,i). It therefore suffices to prove that w(T„,d(u) -I- 1) < 

uj{Ty,i). 
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Let / be any edge-coloring of such that Ci £ Miss(f,v) and w(/) = 
oj(Ty,i). Since |C'(/, v) | = d(v), at least one of the d(w)-|-l colors ci, C 2 , • • • , c,i(^v)+i 
is missing at v. Let cj be any of them, then 1 < j < d(v) + 1 and Cj G Miss(/, v). 
By the definition of oj{Ty,j) we have 

w(T„,j) < w(/) = w(T„,i). (1) 

Since j < d{v) -I- 1, by Lemma 2 

u){Ty,d{v) + l) <ui{Ty,j). (2) 

Thus by Eqs. (1) and (2) we have uj{Ty,d{v) + 1) < w(T„,i). Q.E.V. 



From Lemmas 2 and 3 we immediately have the following lemma. 

Lemma 4. For any vertex v, u)(Ty) = uj(Ty,d{v) + 1). In particular, to(T) = 

Lu{Tr,d{r) + 1 ). 

Thus our DP table for each vertex v consists of only the d{v) + 1 values 
u>{Ty,i), I <i < d{v) + 1, and hence the size is d{v) + 1. All the values oj{Ty,i) 
for i, d{v) + 2 < i < n, are equal to ui{Ty, d{v) + 1). 

For each leaf v of T, the DP table for v consists of a single value uj{Ty, 1) = 0. 
Note that d{v) = 0 and Ty has no edge. 

Thus it suffices to show how to construct the DP table for an internal vertex 
V from the DP tables for all the children V\,V2, - ■ ■ , Vd(y) of v. (See Figure 2.) We 
first notice that, for each i, I < i < d{v) + 1, T„ has an edge-coloring / such that 
u>{f) = uj{Ty,i), Ci £ Miss(f,v), and C{f,v) consists of the first d{v) colors in C 
other than Cj, as in the following lemma whose proof is omitted in this extended 
abstract. 

Lemma 5. For any vertex v of T and any index 1 < i < d{v) + 1, Ty has 
an edge-coloring f such that u){f) = ui{Ty,i), Ci G Miss(/, u), and C{f,v) = Di, 
where Di = {ck | 1 < fc < d{v) l,k ^ i}. 

We then compute the value w(T„, i) for an index i, 1 < i < d{v) 1, from the 
d{v) X {d{v) 1) values u}{Ty.,k) for all indices j, I < j < d{v), and all indices 

k, 1 < k < d{v) 1. Let J be the set of indices j of all children Vj of v, that is, 

J = {j \ ^ < j ^ c?(u)}. Let 6j = {v, Vj) for each j G J, as illustrated in Figure 2. 

Let 1 < i < d{v) 1, and let Ki be the set of indices k of all colors Cfc in Di, 

that is, 

Ki = {k \ 1 < k < d{v) -I- 1, A: yf i}. 

Suppose that / is an edge-coloring of T„ such that w(/) = uj{Ty,i), Ci G 
Miss(/, v), and C{f,v) = Di. Lemma 5 implies that Ty has such an edge-coloring 
/. For each j £ J, let fj be the restriction of / to the edges in Ty. , that is, fj 
is an edge-coloring of Ty. such that fj{e) = /(e) for every edge e in Ty^ Then 
clearly 

^if) = ^{Ty,i) = ^uj{fj)+uj{Di), 
jeJ 



( 3 ) 
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where 

w(A) = 

c&Di 

Note that the value w(-Di) depends on i, but does not depend on the edge- 
coloring /. For each j G J, we denote by j' the index such that Cj> = f{ej) G Di. 
Since / is an edge-coloring of all colors f{ej) are different from each other. 
Therefore the mapping ip : J ^ such that (p{j) = j' G for each j G J 
is a bijection. Since Cj> G Miss{fj,Vj), the definition of oj{Ty.,j') implies that 
However, we have 

(4) 

as follows: if u>{fj) > uj{Ty.,j'), then Ty. has an edge-coloring /j such that 
Cj' G Miss{fj,Vj) and w(/j) = u}{Ty.,j') < uj{fj), and hence the edge-coloring 
/' of Ty obtained from / by replacing fj with /j satisfies Ci G Miss(/',w) and 
C(/',r’) = T>i but w(/') < w(/) = Lv{Ty,i), a contradiction. By Eqs. (3) and (4) 
we have 

‘^(/) = ‘^{Ty,i) = '^uj{Ty.,j') +u{Di). (5) 

jeJ 

Suppose conversely that (p : J ^ Ki is any bijection. For each j G J, we 
denote p{j) by f , that is, f = p{j), and let gj be any edge-coloring of Ty. such 
that uj{gj) = u>{Ty.,j') and Cji G M.iss{gj,Vj). Let g be an edge-coloring of Ty 
extended from all gj, j G J, as follows: 

(•g) ^ I Cj, if e = {v, Vj) and j G J; 

\ gj{e) if e is an edge of Ty. and j G J. 

Then a G Miss(g,v), C(g,v) = Di, and 

i64 

= ^u;(r„.,/)+a;(A). (6) 

j&J 

Let 



b{i) = imn'^uj{Ty.,j'), (7) 

jeJ 

where the minimum is taken over all bijections p : J ^ Ki. Then one can know 
from Eqs. (5) and (6) that w(g) = ui(Ty,i) if and only if p attains the minimum 
value b{i). 

We thus have the following lemma. 

Lemma 6. For any internal vertex v of T and any index i, 1 < i < d{v) + 1, 

Lo{Ty,i) = b{i) + w(A)- 
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Fig. 3. Complete bipartite graph B{i) = {J,Ki]J x Ki). 



One can easily compute the values uj{Di), I < i < d{v) + 1, in time 0{d{v)). 
We shall therefore show how to compute b{i). 

The problem of computing b{i) can be reduced to the minimum weight per- 
fect matching problem for a complete bipartite graph, as follows. Let B{i) = 
{J,Ki;J X Ki) be a complete bipartite graph with vertex sets J and Ki. (See 
Figure 3.) Note that | J| = \Ki\ = d{v). Let ft, : J x iCj — >• R be a weight function 
such that ft(e) = Lo{Ty.,k) for each edge e = (j,k), j G J and k G Ki. The 
minimum weight perfect matching problem is to find a perfect matching M such 
that the sum of weights of the edges in M is as small as possible [12]. A perfect 
matching in B{i) is indicated by thick lines in Figure 3. Clearly, b{i) is equal to 
the weight of the minimum perfect matching in B(i). 

The minimum weight perfect matching problem can be solved in time 0{d^{v)) 
for a single graph B(i) [12], and hence the value b{i) for an index i, 1 < z < 
d{v) + 1, can be computed in time 0{d^{v)). Therefore the d{v) + 1 values b{i) 
for all indices z, 1 < i < d{v) + 1, can be computed total in time 0{d‘^{v)). 

The third idea is to compute all the d{v) + 1 values b{i), 1 < z < d{v) + 1, to- 
gether. Kao et al. recently showed that if a given complete bipartite graph B has 
d vertices then one can solve in time 0{d^) the minimum weight perfect match- 
ing problems for all d complete bipartite graphs obtained from B by deleting 
each of the d vertices [7,8]. Let AT = {1, 2, • • • , d{v) + 1}, let B = {J,K; J x K) 
be a complete bipartite graph, and let ft(e) = k) for each edge e = (j, ft), 

j £ J and k £ K . Applying their algorithm to B, we can compute all the values 
b{i), 1 < i < d{v) + 1, in time 0{d^{v)). We thus have the following lemma. 

Lemma 7. For any internal vertex v of T, one can construct the DP table for 
V from the DP tables for all the children of v in time 0{d^{v)). 

We can thus compute the minimum cost uj{T) = ui{Tr,d{r) + 1) in the DP 
table for root r in time 
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\vev ) Wvev ) ) 

= 0(nZ\2). 

Clearly, the total size of all DP tables is + 1) = 0{n). We do not 

store the bijection ip found at each internal vertex. 



3.2 Finding an Optimal Edge-Coloring 

We next show how to find an optimal edge-coloring / of T. Although we compute 
the DP tables from leaves to the root, we decide the colors of edges from the 
root to leaves, as follows. 

When we computed 

w(T) = w(Tr, d{r) -k 1) = b{d{r) -k 1) -k uj{Dd(r)+i), 

we found a bijection p : J ^ Kj^(^r)+i attaining the minimum value b{d{r) + 1) 
in Eq. (7), where J = ATj;(r)+i = {j I 1 < J < d{r)}. We find the bijection 
p again. For each j, 1 < j < d{r), we color the edge Cj joining r and its jth 
child rj with G = {ci,C 2 ,--- ,C 4 (^r)}- We hence have C{f,r) = 

{ci, C2, • • • , Cdeg{r)\ since d{r) = deg{r). 

We may thus assume that, for an internal vertex v, the edge joining v and its 
parent u has been colored with a color Ci € C for some index i, 1 < i < deg{u). 
Let i' = i a i < d{v) + 1, and let i' = d{v) -k 1 if i > d{v) + 2. We find again 
the bijection p : J ^ Kii attaining the minimum value b(i') in Eq. (7). For each 
j, 1 < J < d{v), we color the edge ej = (v,Vj) with G D^/. We hence have 
C(/, v) = Di, U {c^} c {ci, C2, • • • , Cdeg{v)} U {cj since d{v) -k 1 = deg{v). 

Thus we can correctly find an optimal edge-coloring of T. Clearly, the algo- 
rithm takes time 0{nA^). This completes a proof of Theorem 1. 

Kao et al. also show that if a given complete bipartite graph B has d vertices 
and all edge weights are integers in the range [-N, A^] then one can solve in 
time 0{d^'^ log(dN)) the minimum weight perfect matching problems for all d 
complete bipartite graphs obtained from B by deleting each of the d vertices 
[7,8]. Using their algorithm, one can find an optimal edge-coloring of tree T in 
time 0{nA^-^ log{AN)) if all the color costs are integers in the range [— A^, A^]. 

We finally give a remark. As we mentioned in Introduction, an optimal edge- 
coloring of a graph G does not always use the minimum number x'i^) of colors. 
However, from the algorithm we immediately have the following theorem for trees 
T. Note that x'('T) = L\(T) and that our algorithm does not use the method of 
interchanging colors in an alternating path although it is used in the proofs of 
Lemmas 2 and 5. 

Theorem 2. Any tree T has an optimal edge-coloring using the first A{T) 
cheapest colors ci, C 2 , • • • , ca in C. 
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Abstract. The problem of counting all iif-colorings of a graph G of n 
vertices is considered. While the problem is, in general, ^P-complete, we 
give linear time algorithms that solve the main variants of this problem 
when the input graph G is a fc-tree or, in the case where G is directed, 
when the underlying graph of G is a fc-tree. Our algorithms remain poly- 
nomial even in the case where fc = O(logn) or in the case where the size 
of H is 0{n). Our results are easy to implement and imply the existence 
of polynomial time algorithms for a series of problems on partial fc-trees 
such as core checking and chromatic polynomial computation. 



1 Introduction 

In this paper we consider a series of counting problems associated with various 
versions of homomorphisms from a graph G to a fixed graph H . Given a graph 
H = (V{H),E{V)), with loops but without multiple edges, for any graph G = 
{V{G),E{G)) an homomorphism of G to i? is a map 6 : V{G) — >■ V{El) with 
the property that {w,ic} G E{G) {9{v),9{w)} G E{H). We will call such 
a homomorphism a El -coloring of G. For it fixed, the H-coloring problem asks 
if there is an it-coloring of G. If H is bipartite or it has a loop, then the it- 
coloring problem can be trivially solved in polynomial time, but in the case that 
it is not-bipartite and loopless the problem is known to be NP-complete [HN90] 
(see also [GHNOO] for the complexity of the same problem for bounded degree 
graphs) . 

An earlier version of the tt-coloring problem is the exact H-coloring problem: 
given a graph G, decide if there is an tt-coloring 6* of G such that 9{V{G)) = 
V{H) and 9{E{G)) = E{H). This problem appeared in [GJ79] as problem GT- 
52 and is known to be NP-complete, even for the case where it is a triangle 
[Lev73]. We also consider the intermediate problem of asking whether there is 
an tt-coloring 0 of G such that 9{V{G)) = V{H) and we call it vertex exact 
H-coloring. 

Given a graph G, let H.{G,H) denote the set of all tt-colorings of G, and 
let S{G,H) denote the set of all exact tt-colorings of G. Also, we denote as 
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of the 3rd author was supported by the Ministry of Education and Culture of Spain, 
Grant number MEC-DGES SB98 0K148809. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 298-307, 2001. 
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V(G, H) the set of all vertex _ff-colorings of G. For a fixed graph H, denote 
by #H -coloring the problem of, given G as input, computing \H{G,H)\. Also, 
denote by #EH -coloring the problem of, given as input G, computing \S{G, H)\ 
and by #V EH -coloring the problem of, given as input G, computing |V(G, H)\. 
Dyer and Greehill [DGOO] have proved that unless every connected component of 
H is an isolated vertex without a loop, a complete graph with all loops present or 
a complete unlooped bipartite graph, the ^;^_ff-coloring problem is #P-complete, 
even when G has bounded degree. It is easy to see that the same result hold for 
the #Ai/-coloring (#Gifi7-coloring) as the knowledge of S{G,H') (V(G, iJ')) 
for any (induced) subgraph H' of H implies the knowledge of TL{G,H). For 
further negative results on the approximability of the ^iJ-coloring see [GDF] 
and [DFJ99]. For positive and negative complexity results concerning special 
cases of the ^j(^i/-coloring problem, depending on restrictions on both H and G, 
see [Edw86, DNSOO]. 

The notion of treewidth appears to play a central role in many areas of graph 
theory. Roughly, a graph has small treewidth if it can be constructed by as- 
sembling small graphs together in a tree structure, namely a tree decomposition 
of small width (see section 2 for the formal definitions). It was first defined by 
Robertson and Seymour and served as one of the cornerstones of their lengthy 
proof of the Wagner conjecture, known now as the Graph Minors Theorem (for 
a survey see [RS85]). Treewidth appears to have interesting applications in al- 
gorithmic graph theory. In particular, a wide range of, otherwise intractable, 
combinatorial problems are polynomially, even linearly, solvable when restricted 
to graphs with bounded treewidth (alternatively, graphs with bounded treewidth 
are called partial k-trees). Several general results have been developed for prov- 
ing, constructively, the existence of such algorithms. Among them, we mention 
the work of Gourcelle in [Gou90b] where the existence of polynomial algorithms 
is guarantied by the expressability of the corresponding problem by Monadic 
Second Order formulas (see also [Gou90a]). Similar results have been developed 
in [GM93] for solving counting problems for partial k-trees (see also [GMR]). 
As a consequence of these results, it is possible to contruct a polynomial time 
algorithm solving the -coloring problem for partial fc-trees when k and the 
size of H are fixed constants. Unfortunately, the general results of Gourcelle, 
do not provide implementable algorithms because of the immense hidden con- 
stants in their complexity. Alternatively, the attention of many researchers was 
fixed on developing fast and easy to implement “tailor made” algorithms for 
problems of specific interest. The standard methodology to get such solutions 
consists of a two step procedure: First to find a tree-decomposition of the input 
graph, with treewidth bounded by a constant, although possibly not optimal, 
and second to use some kind of dynamic programming taking advantage of the 
bounded treewidth decomposition of the graph, to get a solution (for a sur- 
vey, see [Bod97]). In particular, such a dynamic programing technique, solving - 
among others - the Ff-coloring problem for partial fc-trees, has been developed 
in [TP97] (also, see [JS97], for counting list colorings and other generalizations 
of colorings). For the first canonical step, Bodlaender in [Bod96] presented, for 
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any k, a linear time algorithm that, given a graph G, checks whether G is a 
partial fc-tree and, if so, outputs an mimimum width tree decomposition. As this 
algorithm appears extremely hard to be implemented (see also [BK96]), the first 
canonical step can be carried out by one of the previously developoed algorithms 
that are able to output a tree-decomposition of G with treewidth bounded by 
a constant (linear in k) [Ree92]. In particular, the deterministic algorithm by 
Reed [Ree92] runs in 0(n log n) time, which, for constant fc, gives a 2 approx- 
imation to the optimal treewidth-decomposition. Moreover, its implementation 
is easier than the algorithm in [Bod96]. Consequently, we will assume that any 
partial fc-tree is always accompanied with its linear size tree decomposition of 
bounded width. 

In this paper we present a polynomial time algorithm for the counting prob- 
lems #i?-coloring, ^EH -color \\ig, and EH -coloring, in the case that the 

input graph G has constant treewidth. Although our methodology follows the 
two cannonical steps, we shall remark that we present an easy to be imple- 
mented algorithm for the second step, asumming the aforementioned results on 
the existence of fast and implementable algorithms for obtaining a bounded 
treewidth-decomposition for a partial fc-tree G. Finally, we stress out that our 
results are the first “tailor made” positive results on counting iJ-colorings for 
the case where H is generic and G is restricted. 

In Section 2 we review the basic definitions and some preliminary results 
on treewith. In Section 3 we present the algorithm solving the ^iJ-coloring 
problem. In Section 4 we present two variants of the algorithm of Section 3, for 
solving the EH -coloring and the ^EH -coloring problem respectively. In the 

final section, we present some consequences of our previous results for a series 
of problems on graphs with bounded treewidth. 



2 Definitions and Basic Resnlts 

Robertson and Seymour introduced the notion of treewidth in the framework of 
the theory of graph minors. 

A tree decomposition of a graph G is a pair {X, U) where C/ is a tree whose 
vertices we will call nodes and X = {{Xi \ i G V{U)}) is a collection of subsets 
of R(G) such that 

1- Uiev(c/) “ ^(^)> 

2. for each edge {u,w} G E{G), there is an i G V{U) such that v,w G Xi, and 

3. for each v G V{G) the set of nodes {i \ v G Xi} forms a subtree of U. 

The width of a tree decomposition {{Xi \ i G V{U)}, U) equals msoci^v 

1}. The treewidth of a graph G is the minimum width over all tree decompositions 

of G. Computing treewidth is an NP-complete problem [ACP93]. 

We now define some extensions of the definition of a tree decomposition 
were introduced by Boadlaender and Kloks [Bod96] (see also [Arn85]). A rooted 
tree decomposition is a triple D = {X,U,r) in which G is a tree rooted at r and 
{X, U) is a tree decomposition. Let D = {X, U, r) be a rooted tree decomposition 
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of a graph G where X = {Xi \ i &V{U)}. D is called a nice tree decomposition 
if the following are satisfied: 

1. Every node of U has at most two children, 

2. if a node i has two children j, h then Xi = Xj = Xh, 

3. if a node i has one child, then either \Xi\ = |Xj| + 1 and Xj C Xi or 

|Xi| = \Xj\ - 1 and Xi C Xj. 

The following lemma can be found in [BK96] . 

Lemma 1. For any constant k > 1, given a tree decomposition of a graph G of 
width < k and 0{n) nodes, where n is the number of vertices in G, there exists 
an 0{n) algorithm that constructs a nice tree decomposition of G of width less 
or equal than k and with at most 0(n) nodes. 

We observe that a nice tree decomposition D = {{Xp \ p G V{U)},U,r) 
contains nodes of the following four possible types. A node p G V {U) is called: 
Start, if p is a leaf. Join, if p has two children qi,i = 1,2, Forget, if p has 
only one child q and \Xp\ = \Xq\ — 1, Introduce, if p has only one child q and 
|Xj,| = |X,| + l. 

Notice that for every start node we may assume that \Xp\ = 1: the effect 
of start nodes with \Xp\ > 2 can be obtained by using a start node with a set 
containing one vertex, and then \Xp \ — 1 introduce nodes, which add all the other 
vertices. For the purposes of this paper we will assume that the root r is a forget 
node with X^ = 0. Taking in mind Lemma 1, we will assume that any partial 
k-tree is given along with a nice tree decomposition of it. 

Let D = {X, U, r) be a nice tree decomposition of a graph G. For each node 
p of D, let Up be the subtree of U, rooted at node p. We set Vp = yiv^v{Up)^v 
and, for any p G V{U), we define Gp = G[V^]. Notice that Gr = G. 

3 An Algorithm for Counting the Number of i?-Colorings 

Let H he & fixed graph. Given an input graph G together with a nice tree 
decomposition D = {X, U, r) of it, we wish to count \H{G, H)\, i.e. to solve the 
^iL-coloring problem for G. 

The algorithm will do a traversal of D computing and tabulating for each p G 
V{U) the appropriate information. Before describing it, we need some additional 
definitions. 

Given a nice tree decomposition D = {X, U, r) of a graph G, for each p G 
V{U), let Fp = {if ■. Xp -G V{H)}. Notice that if G has treewidth k, then for 
any p G V{U), \Fp\ < Moreover, for the root we have F^ = {0}. 

The table associated to a node p GV (U) will have an entry for each (p G Fp, 
holding the value Ip{p) = \{0 G "H(Gp,iL) | 0\xp = p}\ (in general, if 0 is a 
function, we denote 0|s = {(u, a) G 0 \ v G S'}). As we always have 9\(j = 0, we 
get \H{G,H)\ = A(0). 

The function count-H takes as input G, D and a topological ordering S = 
(pi, . . . ,ps) of V{U) and outputs the value lr{0). 
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function count-H(G, i7, D, 5) 

D = {X, U, r) is a nice tree decomposition of G with width k 
n = \V(G)\,h = \V{H)\,s=\V{U)\ 

S = {pi, ■ ■ ■ ,Ps) is a topological ordering of V{U) 

begin 

for p := pi to ps do 

if p is a start node, with Xp = {w} then 
for all a € V (H) do Ip{{v,a)) := 1 

end 

if p is a introduce node then 
let q be its unique child 

V 1= Xp 

:= {li 6 I 6 E{Gp)} 
for all (fi €. Fq and a € V{H) do 
if V„gs,{ip(ii),a} 6 E{H) then 

Ip{(pU {{v,a)}) := Iq{(p) 

else Ip{<p U {(«, a)}) := 0 

end 
end for 

erase the information on node q 

end 

if is a forget node then 
let q be its unique child 

V 1 = Xp 

for all (f € Fp do 

fp(v) ^2aeV(H) U {('*^5 ®)}) 

end for 

erase the information on node q 

end 

if p is a join node with children qi, and q2 then 
for all (f € Fp do 

Ip{(p) := Iq,{(p) ■ Iq^{(p) 

end for 

erase the information on nodes qi and q2 

end 
end for 
return lr{0) 

end 

Theorem 1. Given a graph G with n = |F(G)|, a nice tree decomposition 
D = {X, U, r) of G with width k, and a topological ordering of the nodes in 
D. Then there exists an algorithm that computes the number of H-colorings 
of G in 0{nh^^^ Tain{k,hY) steps using logn) additional space, where 

h=\V{H)\. 

Notice that for constant treewidth k, we can allow h = 0(n) and still 
have a polynomial time algorithm. Taking in mind the algorithm of Bodlaender 
in [Bod96] we have the following. 

Corollary 1. Given a graph G if it has treewidth bounded by a constant k, then 
we can compute \TL{G,H)\ in 0{nh^~^^ mm{k, h}) + 0{f{k)n) steps. 
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For constant k and h, our algorithm works in linear time and therefore we 
have the following. 

Corollary 2. The #H-coloring problem, with parameters the treewidth of the 
input and the size of H, is a fixed parameter tractable problem (i.e. belongs to 
the class FPT). 

By suitably altering function count-H we can have the analogous of Theorem 1 
and Corrolary 1 for the ffV EH -coloring and the ffEH-coloring problems. 

Theorem 2. Given a graph G with n = |y(G)|, a nice tree decomposition D = 
{X, U,r) of G with width k, and a topological ordering of the nodes in D. Then 
there exists an algorithm that computes the number of vertex exact H-colorings 
of G in max{fc, 2^^}) time using log n) additional space, 

where h = \V{H)\. 

Corollary 3. Given a graph G with n vertices and treewidth bounded by a con- 
stant k, then we can compute \V{G,H)\ in 0(n/i^+^2^ max{/c, 2^^}) + 0(/(fc)n) 
time, where h= \V{H)\. 

Theorem 3. Given a graph G with n = |C(G)|, a nice tree decomposition 
D = {X, U, r) of G with width k, and a topological ordering of the nodes in 
D. Then there exists an algorithm that computes the number of H-colorings 
of G in 0(n/i*+^2^+® max{A:, time using log n) additional 

space, where h = \V{H)\ and e = \E{H)\. 

Corollary 4. Given a graph G with n vertices and treewidth bounded by a 
constant k, then we can compute \S{G,H)\ in 0(n/i^+^2'*+® max{A:, 2^^^+®)} + 
0{f{k)n) time, where h = \V{H)\ and e= \E{H)\. 

Notice that for each of the algorithms given so far, if we retain the infor- 
mation of all the tables, it is possible to use it in order to enumerate all the 
homomorphisms. The storing of all the tables implies a burden of 0(n/logn) to 
the space reported in Theorems 1, 2, and 3. In particular, setting up a suitable 
bookkeeping of the enumerated homomorphisms, a top-down traversal of the 
table information can pop-up each of them in 0(n) steps. 

We point out that in all algorithms of this section and for constant h, we 
can allow k = O(logn) and still have polynomial time. Finally, all the results 
of this section can be adapted straighforwardly to solve the directed versions of 
the problems {H and G are directed graphs). 

4 Extensions to Other Resnlts 

In this section, we present some consequences of the polynomial time algorithms 
given in the previous sections. For each problem, either we give a new polynomial- 
time result or we improve previously known results. Recall that if we are given a 
graph G, directed or undirected, all of our algorithms should be used in conjuc- 
tion with one of the algorithms mentioned in the introduction, that in polynomial 
time, compute the treewidth decomposition of bounded width. 
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4.1 Coloring Problems 

Notice that a graph G is iJ-colorable if |'H(G, H) \ > 1. This implies the following 
corollary of Theorems 1, 2, and 3 for the corresponding decision version of the 
problems examined. 

Corollary 5. For any graph H with h vertices and e edges, we can solve, for 
an n-vertex partial k-tree as input graph, 

— the F[ -coloring problem in 0{nh^~^^ minj/c, h}) steps, 

— the vertex exact F[ -coloring problem in max{fc, 2^}) steps, and 

— the exact H -coloring problem in max{A:, 2^+'^}) steps. 

Corollary 5 improves the time of the best known algorithm for the iJ-coloring 
of partial /c-trees. This algorithm runs in steps and is due to Telle 

and Proskurowski in [TP97]. 

We denote as K,, the complete graph with c vertices. As the problem of 
counting the number of c-colorings of a graph G is equivalent to the ^ATc“Coloring 
problem we can conclude to the following. 

Corollary 6. An algorithm can be constructed to compute the number of c- 
colorings of a partial k-tree of n vertices in 0{nc^~^^ min{k,c}) steps. 

We denote as /3g{c) the function mapping c to the number of colorings of G 
that use exactly c colors, i.e. Pg{c) = |V(G, A'c)|- 

Lemma 2. For any graph G, Pg{c) = \n{G,Kc) \ - J2i<r<c-i O^oir). 

The formula of Lemma 2 and Corollary 6 give a way to compute jdGic). 

Lemma 3. An algorithm can be constructed that, given partial k-tree G with n 
vertices and for any c,l < c < n, computes /3 g{c) in 0{nc^^^ min{fc, c}) steps. 

Corollary 7. An algorithm can be constructed that computes the chromatic 
polynomial of a partial k-tree with n vertices in 0{kn^^^) steps. 

In our knowledge, the best algorithm for computing the chromatic polymonial 
for partial fc-trees can be derived from the algorithm of Andrzej ak [And98]. This 
algorithm can compute in steps the Tutte polynomial of a partial 

/c-tree of n vertices, where c is twice the number of partitions of a set with 
3(/c + 1) elements. Several researchers have considered the problem of counting 
the number of proper c-colorings in a graph G. The problem is ^P-hard for 
c > 3 and maximum degree A > 3 [Jer95]. In the same paper, it is proved 
that there exists a Fully Polynomial time Randomized Aproximation Scheme 
(FPRAS) for the number of colorings in the case when c > 2 A -\- 1. Recently, 
Bubley et all. in [BDGJ99] proved that the problem is ^j^P-hard for fixed A, but 
there is a FPRAS for c = 5 and A = 3. Edwards [Edw86] proved that if c > 3 
and the minimum degree where 5 > an, n = |P(G)|, the counting problem is 
#P-complete if a < but it is in P for a > 
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4.2 Problems on Cores 

Given a graph G, a core of G is a subgraph G of G such that G is homomorphic 
to G, but G fails to be homomorphic to any proper subgraph of G. This notion 
of core is due to Hell and Nesetfil [HN92]. A graph G is a core if G is a core for 
itself or, equivalently, if \£{G,G)\ = \H{G, G)|. For example, any graph G 2 k+i is 
a core (G 2 fc+i is the connected 2-regular graph with 2fc-|- 1 vertices). It is known, 
that in general, the problem of deciding whether G is a core is NP-complete 
[HN92] and in the same paper it is presented a polynomial time algorithm, to 
decide if G is a core, for the particular case when G has independence number 
a{G) < 2. Finally, we mention that “almost all graphs” are cores [Nes99]. 

We first prove the following. 

Corollary 8. An algorithm can he constructed that decides, in steps, 

whether a partial k-tree with n vertices is a core. 

Proof. Notice that it is enough to check whether there exists a subgraph H 
of G where \E{H)\ = |A(G)| — 1 and such that \'H{G,H)\ > 1. According to 
Theorem 1, this requires 0(|if(G) = 0{k^n^^^) steps (take in mind that 

if G is a partial fc-tree then E{G) = 0(fc|C(G)|)). 

We mention that the property of being a core is an EMS-property (i.e. in- 
volves counting or summing evaluations over sets definable on monadic second 
order logic) and therefore, the results in [Cou90b, ALS91] imply the existence 
of a polynomial time algorithm for deciding it. So far, no explicit algorithm has 
been reported for this problem. 

We consider the isomorphism problem on cores. Notice that graphs G and El 
are isomorphic iff \£{G, H) \ ■ \£{H, G) | > 1. This check can be done in polynomial 
time when both G and H are cores and partial k-trees. 

Corollary 9. An algorithm can he constructed that checks, in 0{kn^^^) steps, if 
two cores G and H of treewidth < k are isomorphic (n = max{|l/(G)|, \V{H)\}). 

In general, checking whether two partial fc-trees are isomorphic can be done 
in steps [Bod90]. Corollaries 8 and 9 improve this time for the case 

where some of the input graphs is a core. 

Using the idea of Corollary 9, we can count automorphisms of cores with 
bounded treewidth. 

Corollary 10. An algorithm can be constructed that outputs, in 0{kn^^^) steps, 
the number of automorphisms of a core G of n vertices and treewidth < k. 

A graph G is called rigid when the identity is the only homomorphism in 
%{G,G) [Nes99]. We conclude to the following. 

Corollary 11. An algorithm can he constructed that checks, in 0{kn^'^‘^), steps 
whether a n-vertex partial k-tree is rigid or not. 




306 



Josep Diaz, Maria Serna, and Dimitrios M. Thilikos 



4.3 Problems on Counting Independent Sets 

In the case where H is the unique graph with two vertices one edge and one 
loop, |'H(G,i7)| is the number of independent sets of G. We have the following. 

Corollary 12. An algorithm can he constructed that outputs the number of in- 
dependent sets of a n-vertex partial k-tree in 0{n2^^^) steps. 

Notice that the linear algorithm of Corollary 12 remains polynomial even if 
the treewidth of G is bounded by 0(log n). We could remark that, by specializing 
the structure of H, we can generate an arbitrary number of counting problems 
that are, in general, ^P-complete. As an example we mention the problem of 
counting the g-particle Widom-Rowlinson configurations in bounded treewidth 
graphs [DGOO]. 

Suppose that iJ is a star with a loop attached on its unique internal vertex 
(a star is a tree of diameter 2). Then, asking whether |V(G, H)\ > 1 is equivalent 
to ask whether G has an independent set of size > k. Finally, Theorem 2 yields 
the following. 

Corollary 13. An algorithm can he constructed that for any partial k-tree of n 
vertices and any r, outputs the number of the independent sets of G with at least 
r vertices in 0(n(r + max{fc, 2^*^’’+^^}) steps. 
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Abstract. We give 0(m + n) algorithms for enumerating all minimum 
separators as well as minimal separators in a connected chordal graph. 
We give a tight upper bound, n — k{G) — 1, for the number of minimal 
separators in a chordal graph. An important contribution of the paper is 
our characterisation of minimal (minimum) separators of chordal graphs 
and the above results are obtained as consequences of the characterisa- 
tions. 



1 Introduction 

Let (7 be a cycle in a graph G. A chord of C is an edge of G joining two vertices 
of G which are not consecutive. A graph G is called a chordal (or triangulated) 
graph iff every cycle in G of length 4 or more has a chord. Chordal graphs arise 
in many applications (see [6,15,20]). Chordal graphs constitute one of the most 
important subclasses of perfect graphs [6]. 

In a connected graph G, a separator S' is a subset of vertices whose removal 
separates G into at least two connected components. S is called a (a—b) separator 
iff it disconnects vertices a and b. A {a — b) separator is said to be a minimal 
separator iS it does not contain any other (a—b) separator. A minimum separator 
is a separator with minimum size. Clearly every minimum separator is a minimal 
separator. The (vertex) connectivity, k(G) is defined to be the size of a minimum 
separator. The connectivity of a complete graph is by definition jCj — 1. It doesn’t 
have any separators. For basic notions about connectivity, we refer to [7,6]. 

The problem of listing all minimal separators and minimum separators is 
one of the fundamental enumeration problems in graph theory, which has great 
practical importance in reliability analysis for networks and operations research 
for scheduling problems [9,5,1]. 

The related problem of finding the connectivity of a graph is conjectured to 
have an 0{m + n) algorithm (see [4] for history). The interest in finding better 
or different algorithms is still very much alive [19,16,17,3,22,18,4]. The latest 
and the fastest in the sequence is due to Gabow [4], which runs in 0{n^k + 
nk{min{k"i , kn*))) time, where k = n{G). In this paper, we present an O(m-l-n) 
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algorithm, restricted to the class of chordal graphs, which not only finds the 
connectivity, but lists all minimum separators. 

The problem of listing all minimum or minimal separators of an undirected 
graph is considered by various authors [5,9,10,8]. In [9], a 0(2^n^) algorithm 
is given for listing all minimum separators. A 0{n^Rs) algorithm is given is 
[10], to list all minimal separators, where Re is the total number of minimal 
separators in the graph. This is improved in [8] to 0{n^R^ + n'^Rs), where 
Re — {'n{n — l)/2 — m)RE- Restricted to the class of chordal graphs, we present 
algorithms that lists all minimal (or minimum) separators in 0(m + n) time, 
which compares sharply with the general case running times available in [9,10,8]. 
Algorithms to enumerate the minimal separators for some other subclasses of 
perfect graphs (eg. permutation graphs) are given in [12,11]. 

As an auxiliary result we show that the number of minimal separators in a 
chordal graph is upper bounded by n — k(G) — 1. We also show the tightness of 
this upper bound by constructing a graph on n nodes with number of minimum 
separators = number of minimal separators = n — k{G) — 1. Some other impor- 
tant classes of graphs which are known to have polynomial number of minimal 
separators include permutation graphs, circular permutation graphs, trapezoidal 
graphs, circle graphs, circular arc graphs, distance hereditary graphs, chordal bi- 
partite graphs, cocomparability graphs of bounded dimension and weakly trian- 
gulated graphs [11,12,13,14]. Note that in general graphs, the number of minimal 
separators can be exponential [10]. In [9] it is shown that in a general graph, the 
number of minimum separators is bounded by 0(2^^), where k = k{G). For 
information on why number of minimal (or minimum) separators are important 
see [10,12]. 

An important contribution of this paper is the structural understanding of 
separators of chordal graphs in general and the minimum and minimal separators 
in particular (Lemma 5, Theorems 1, 2). As far as we know, these characteri- 
sations are new. Also we haven’t seen any algorithm for listing the minimal 
separators of a chordal graph elsewhere. But we point out that an algorithm 
which runs in worse than linear time (eg 0{n^{m + n))) may be designed with 
weaker understanding of the separator structure. The upper bound (Theorem 
5) can be obtained in other ways also, though we haven’t seen it mentioned 
anywhere else. 

The only fact about chordal graphs, the reader should be familiar with, for 
reading this paper, apart from the notion of PEO, (which will be explained in 
the next subsection) is a well known characterisation due to Dirac [2,6]. 

Lemma 1. A graph G is a chordal graph if and only if every minimal separator 
of G induces a complete subgraph. 

1.1 PEO, Some Definitions, and Notations 

Let G = (y, E) be a connected undirected graph. Throughout this paper we will 
use V for the set of vertices of G and E for the set of edges. |y| and \E\ will be 
denoted by n and m respectively. N{v) will denote the set of neighbours of v, 
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that is N{v) = {u £ V : (u,v) £ E}. For A C y, we use N{A) to denote the set 
UjjsA — A. The subgraph of G induced by the nodes in A will be denoted 
by G{A). 

A bijection f : V — >■ {1, 2, • • • , n} is called an ordering of the vertices of G. 
Then f(v) is referred to as the number associated with the vertex v, or simply 
the number of v with respect to the ordering /. Given an ordering / of a graph 
G, we define the following terms. 

Definition 1. Let A C V. The highest(A) is defined to be the vertex with the 
highest number in A. Similarly lowest(A) is the vertex in A with the lowest 
number. 



Definition 2. A path P = {w\,W 2 , • • • , Wk) in G is called an increasing path, iff 
f{wi) < f{w 2 ) < ■ ■ ■ < f{wk). It is called a decreasing path iff f{wi) > f{w 2 ) > 

■ ■ ■ > f{wk). A single node can be considered as either increasing or decreasing. 



Definition 3. A vertex u G N{v) is called a higher neighbour of v iff f{u) > 
f{v). The set of higher neighbours of v will be denoted by Nh{v), ie 

Nh{v) = {u£ N{v) : f{u) > f{v)} 

Similarly the set of lower neighbours of v is denoted by Ni(v). 

Ni{v) = {u£ N{v) : f{u) < f{v)} 
dh{v) = |fV/i(u)| and di{v) = |iVi(u)|. 

Definition 4. An ordering f ofGis called a perfect elimination ordering (PEO) 
iff for each v £V, G({u} IJ is a complete subgraph (clique) of G. Then 

f{v) will be called the PEO number of v. 

Note that every graph may not have a PEO. In fact it is known that a graph 
G is chordal if and only if there exists a PEO for G [6]. A 0{m + n) algorithm 
to construct a PEO for a chordal graph G is given in [21]. Note that there can 
be more than one PEO for a given chordal graph. The observations presented 
in this paper are valid with respect to any PEO. Therefore we just use PEO{v) 
to denote the PEO number of u. It will be clear from the context which PEO is 
being referred to. 

From now on we are only interested in chordal graphs. We will be using G 
to denote an arbitrary connected chordal graph. We close this subsection with 
one more definition. 

Definition 5. A vertex v is called an interior vertex of G, iff there exists a 
vertex u ^ N{v) with PEO{u) > PEO{v). Note that v is an interior vertex if 
and only if dh{v) < n — PEO{v). We denote the set of interior vertices by V . 
That is, V' = {v £ V : dh{v) < n — PEO{v)}. 
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2 The Minimum Separators 

A chordless path from m to v is defined to be a path from u to t; in G such that 
no two non consecutive nodes of the path are adjacent. The reader can easily 
verify that if there is a path between u and v then there is a chordless path also. 
For example, the shortest path between u and v has to be a chordless path. 

Lemma 2 . Let P = {wi,W2, ■ ■ ■ ,Wk) be a chordless path in a chordal graph 
G and let Wi = highest(P). Then {wi,W2, ■ ■ ■ ,Wi) is an increasing path while 
{wi, Wi+i, ■ ■ ■ ,Wk) is a decreasing path. 

Proof: Consider first the path {wi,W2, ■ ■ ■ ,Wi). Since Wi = highest(P), this 
cannot be a decreasing path. If f < 2 , obviously this is an increasing path. 
Let f > 2 . If {wi,W2, ■ ■ ■ ,Wi) is not an increasing path, then there should be a 
node Wj € P, I < j < i, such that PEO{wj) < PEO{wj+\) and PEO{wj) < 
PEO{wj-i). Then {wj-i,Wj^i} C Nfi{wj). By definition ofPEO, Nh{wj) forms 
a clique, and we have (wj-i,Wj+i) € E. So {wj-\,Wj+i) forms a chord for P, 
contradicting the chordless property of P. We infer that {wi,W2, ■ ■ ■ ,Wi) is an 
increasing path. A similar argument shows that (wj, Wi+i, • • • ,Wk) should be a 
decreasing path. □ 

Corollary 1 . Let P={wi,W2, - ■ ■ ,Wk) he a chordless path and Wk=highest{P). 
Then P is an increasing path. 

Corollary 2 . Let P = {w\,W2, - ■ ■ ,Wk) he a chordless path and PEO{w\) < 
PEO{wk). Then PEO{w2) > PEO{wi). 

Lemma 3 . Let P = {w\,W2, ■ ■ ■ ,Wk) be an increasing path in a chordal graph. 
Let PEO{u) > PEO{wk) where u £ N{wi). Then u € Nh{wk)- 

Proof: We prove the lemma by induction on |P|, the number of nodes in P. 
For |P| = 1, the lemma is trivial. Assume that for all increasing paths with |P| = 
k — 1 , where A: > 1, the lemma is true. Let P = (wi,W2, ■ ■ ■ , Wk) he an increasing 
path with |P| = k. Note that we have PEO{u) > PEO{wk) > PEO{wk-i) since 
P is an increasing path. Applying the induction assumption on the (fc— 1)— node 
path {wi,W2, - ■ ■ ,Wk-i), we get u G Nh{wk-i). Now {wk,u} C Nh{wk-i) and 
by definition of PEO, Nh{wk-i) is a clique, and (u,Wk) £ E. Remembering 
PEO{u) > PEO{wk), we conclude u £ Nk{wk). □ 

Lemma 4 . Let A C V such that G{A) is connected. Let x = highest(A) and 
z = lowest{N (A)) . Then if PEO {x) < PEO{z), N{A) = Nh{x). 

Proof : First note that since x = highest(A), Nh{x) {^A = (f and therefore, 
Nh{x) C N{A). Now we will prove N{A) C Nk{x), from which we can conclude 
N{A) = Nh{x). Let y G N{A). Then there is a m G A such that y G N{w). 
Also PEO{y) > PEO{z) > PEO{x). Consider a chordless path P = {w, • • • , a;), 
which is completely in G{A). Such a path exists, since G{A) is connected. Also 
since x = highest{P), by corollary 1 , P is an increasing path. Then by lemma 3 , 
y G Nfi{x). Thus we conclude that N{A) C Nh{x). It follows that N{A) = Nh{x). 
□ 
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Lemma 5. Let x be an interior vertex in a chordal graph G, ie dh{x) < n — 
PEO{x). Then Nh{x) is a separator of G. In particular, Nh{x) is a {x — y) 
separator for all y ^ Nh{x) with PEO{y) > PEO{x). 

Proof: Let PEO{x) = t. Consider the induced subgraph G(Vt) on the set of 
nodes Vt = {u & V : PEO{u) < t}. Let Gi be the connected component of 
G{Vt) which contains x. Obviously x = highest{Gi). Since N{Ci) C V — Vt, 
clearly every node z G N{Ci) has PEO{z) > t = PEO{x). Now by lemma 4, 
N{Ci) = Nh{x). Since dh{x) < n — PEO{x), there should be a node y such 
that PEO{y) > PEO{x) and y ^ Nh{x). Clearly y ^ Gi since x = highest{Gi). 
Therefore Nh{x) separates x from y, from which the lemma follows. □ 

Lemma 6. Let k{G) be the connectivity of a chordal graph G. Then the number 
of interior nodes, \V'\ is at most n — k{G) — 1. 

Proof: Let x be an interior node of G. Note that by lemma 5, Nh{x) is 
a {x — y) separator for some y with PEO{y) > PEO{x). Then if PEO{x) > 
n — k{G), |iVft(x)| < k{G), which contradicts the definition of k{G). Thus we 
infer that PEO{x) < n — k{G) — 1. It follows that \V'\ < n — k{G) — 1. □ 

Theorem 1. (Characterisation of minimum separators) Let G be a 

chordal graph with k{G) = k. Let M be the set of all minimum sized separa- 
tors of G. Then 

M = {S C V : S = Nh{x) for some x G V' such that dh{x) = k} 

where V = {u G V : dh{u) < n — PEO{u)}, the set of interior nodes of G. 

Proof: First we prove that every S' G M is indeed a minimum separator of 
G. By lemma 5, S is a separator of G. Since we have k = dh{x) = |iV/j(a;)| = |S|, 
S is a minimum separator, since k{G) = k. 

Now let S be a minimum separator, ie |S| = k. Also let V be partitioned 
into A, S and B by S. Let x = highest(A) and let y = highest(B). Without loss 
of generality let PEO{x) < PEO{y). Then x is an interior vertex of G, since 
y ^ N{x) and PEO{y) > PEO{x). By lemma 5, Nh{x) is a separator. Also since 
k{G) = k, |N/j(a:)| > k. Clearly Nh{x) C S, since x = highest{A). Then, since 
|S| = k, we conclude that Nh{x) = S, and dh{x) = k. Thus S G M. □. 

Corollary 3. Let G be a chordal graph, which is not a clique. Then k{G) = 
minuev’dh{u) , where V = {u G V \ dh{u) < n — PEO{u)}, the set of interior 
nodes. 

Algorithm to enumerate minimum separators 

Input : A connected chordal graph G, which is not complete. 

Step 1: Compute a PEO of G. 

Step 2: Find dh{u) for each uGV. 

Step 3: Let V' = {u : dh{u) < n — PEO{u)}. 

Step 4: k(G) = minuev'dhiu). 
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Step 5: For each u G V , if dh{u) = k{G) output Nh{u). 

It can easily be verified that the above algorithm runs in 0{m + n) time. 
Computing the PEO takes 0{m+n) time [21]. Steps 2 and 5 can be implemented 
in 0{m) time, since it only involves traversing the neighbour lists of each node. 
Steps 3 and 4 takes 0{n) time. 

The correctness of step 4 is ensured by corollary 3 and the correctness of 
step 5, namely the sets output are exactly the minimum separators is ensured 
by Theorem 1. 

3 The Minimal Separators 

Theorem 2. (Characterisation of minimal separators) S is a minimal 
separator of a chordal graph G if and only if there exist two nonadjacent vertices 
a,b G V, such that PEO{a) < PEO{b), and S = Nh{a) = Nh{a) P| N{b). 

Proof: First assume that there are two nonadjacent nodes a, b with PEO{a) 
< PEO{b), such that S = Nh{a) = Nh{a) f^N{b). Clearly a is an interior node 
and S = Nh(a) is a separator which separates a from 6, by lemma 5. Clearly S 
is minimal since S = Nh{a) p| A^(6) : removal of any proper subset of S leaves 
out a path from a to b. 

Next, let S' be a minimal separator which separates vi from V 2 - Let A and B 
be the connected components of G(P — S), which contains v\ and V 2 respectively. 
First we show that there exists a node a G A and a node b G B such that 
S = N{a)[^N{b). 

First note that S = N{A) = N{B). To see this, observe that because of 
the minimality of S, S C N{A). Clearly since A is a connected component of 
G{V — S), N{A) C S, also. Thus we have S = N{A). By similar arguments 
S = N{B). 

Let 0 = lowest{S), and x = highest(A). If PEO{x) < PEO{z), then since 
N{A) = S, by lemma 4, we have S = iV(A) = Nh{x) C N{x). In this case let 
a = X. 

If PEO{x) > PEO{z), consider a chordless path from ^ to x, which is com- 
pletely in the induced subgraph G{A[_}{z}). (Observe that such a path exists, 
because G{A[J{z}) is connected since z G N(A)). Let u be the second node in 
this chordless path starting from z. By corollary 2, PEO{u) > PEO{z). Also 
by lemma 1, S forms a complete subgraph of G. Thus we have S — {z} lj{u} C 
Nh{z), since z = lowest{S). By definition of PEO, Nh{z) lJ{-2^} form a complete 
subgraph and thus we have S C N{u). In this case let a = u. 

By similar arguments as above, we get a node b G B such that S C N{b). 
Now we show that S = N{a)f]N{b). Clearly S C N{a)f]N{b). Suppose there 
is a node v G {N{a) fj N{b) — S). Then S can not separate a from b, which is a 
contradiction. Thus we have S = N{a)f^N{b). 

Without loss of generality, assume that PEO{a) < PEO{b). We will now 
show that S = Nh{a) P| N{b). First we claim that PEO{a) < PEO{z). Otherwise 
let PEO{b) > PEO{a) > PEO{z). Note that since a and b are neighbours of 
every node in S, we have a,b G Nh{z). By definition of PEO, there will be an 
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edge between a and b which is a contradiction since S separates a from b. Thus 
PEO{a) < PEO{z), and we have S C Nh{a). It follows that S = Nh{a) f]N{b). 

Finally, since we have established PEO{a) < PEO{z), it follows that a = x = 
highest(A) . (Remember we chose a, from two choices: one was x, the highest(A) 
and the other was u, the second node in the chordless path from z to x. In the 
latter case, PEO{a) > PEO{z) ). Thus we have S = Nh{a). 

□ 

The above theorem says that every minimal separator has to be Nh(u) for 
some interior vertex. Our algorithm to enumerate all minimal separators ex- 
amines Nh{u) for each interior vertex u, and decides whether it is a minimal 
separator or not. We consider a node w to be a witness for Nh{u), if the ex- 
istence of w proves that Nh{u) is a minimal separator. For example, if we can 
find a node w, nonadjacent to u with PEO{w) > PEO{u) and N{w) A Nh{u), 
then by Theorem 2, u; is a witness for Nh{u). Thus the issue in designing the 
algorithm is to efficiently identify a witness, if one exists. 

In our algorithm, we make use of only two types of witnesses. These types 
are defined in terms of a special node in Nh{u), namely z = lowest{Nh{u)). 

Definition 6. First type witness : A node w is defined to be a first type witness 
for Nh{u), iff w G Nh{z) - N{u). 

Lemma 7. (First type witnesses are indeed witnesses) If there exists a 
node w € Nh{z) — N{u), then Nh{u) is a minimal separator. 

Proof: First note that w G Nh{z) — N{u) is not adjacent to u and since 
w £ Nh{z), PEO{w) > PEO{u). Now note that since Nh{u) is a clique, and 
z = lowest{Nh{u)), we have Nh{u) — {z} C Nh{z). Also w G Nh{z). Since by 
definition of PEO, {z}\JNh{z) forms a clique, it follows that Nh{u) C N{w). 
Then by Theorem 2, w is a witness for Nh{u). □ 

Definition 7. Second type witness: A node w ^ u, is defined to be a second type 
witness iff w £ Nfz) — N{u), and Nh{w) = Nh{u). 

Lemma 8. (Second type witnesses are indeed witnesses) If there exists 
w ^ u, w £ Ni{z) — N{u), such that Nh{w) = Nh{u), then Nh{u) is a minimal 
separator. 

Proof: First note that w is nonadjacent to u. If PEO{w) > PEO{u), clearly 
since Nh{u)f^N{w) = Nh{w) = Nh{u), by Theorem 2, Nh{u) is a minimal 
separator and w is a witness for Nh{u). If PEO{w) < PEO{u), we just have to 
interchange the roles of u and w, and we again get Nh(u) = Nfi{w) is a minimal 
separator. □ 

The first and second type witnesses certainly do not exhaust the set of all 
possible witnesses. But it turns out that these two types are sufficient for our 
algorithm. This is because of the following lemma, which assures that, if first 
type witnesses are not available, then a second type witness is guaranteed to 
exist, provided the Nh{u) in question is a minimal separator. 
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Lemma 9. Let Nh{u) be a minimal separator. If Nh{z) — N{u) = 4>, then there 
exists a node w ^ u, such that w € Ni{z) — N{u), and Nh{w) = Nh{u). 

Proof: Since Nh{u) is a minimal separator, there exists a node v G V, 
such that Nfi{u) minimally separates u from v. (If Nh(u) is a minimal (vi — 
V 2 ) separator, at least one of them, either vi or V 2 , should be in a different 
connected component than that of u, when Nh(u) is removed from G. Then 
clearly Nh{u) minimally separates u from that node). Let A be the connected 
component of G{V — Nh{u)) which contains v. Clearly u ^ A. First note that 
by minimality of the separator Nh{u), Nh{u) = N{A). Let x = highest{A). We 
claim that PEO{x) < PEO{z). Otherwise, if PEo{x) > PEO{z), consider a 
chordless path P = (z,Wi, - ■ ■ ,x) in G(AlJ{z}), which is guaranteed to exist 
since G(A1J{2}) is connected. Then clearly rui G Nh{z), by corollary 2. Also 

^ N{u), since w\ G A and u is separated from A by Nh{u). Thus wi G 
Nh{z)—N{u), contradicting the assumption that Nh{z)—N{u) = 4>. We infer that 
PEO{x) < PEO{z). We conclude from lemma 4, that Nh{x) = N{A) = Nh{u). 
Also since x G Ni{z) and x ^ N{u), we have x G Ni{z) — N{u). The lemma 
follows. □ 

Finally, the reason why we consider these special kind of witnesses, namely 
the second type witnesses, is exactly that, they are easy to identify. The following 
lemma explains this. 

Lemma 10. Suppose that Nh{z) — N{u) = (f). A node w ^ u, w G Ni{z) — N{u) 
is a second type witness if and only if dh{w) = dh{u) and lowest{Nh{w)) = z. 

Proof : If ru is a second type witness, ie, if Nh{w) = Nh{u), clearly we 
have dh{u) = dh{w) and lowest{Nh{w)) = z. Now suppose that there is a node 
w G Ni{z) — N{u), with dh{u) = dh{w) and lowest{Nh{w)) = z. We first show 
that Nfi{w) C Nh{u). Let y G Nh{w). Noting that by the assumption 2 = 
lowest{Nh{w)), z G Nh{w) also, we get (z,y) G E, since Nh{w) should form a 
clique. But 2 = lowest{Nh{w)), therefore PEO{y) > PEO{z), ie y G Nh{z). 
Now if y ^ Nh{u), then y G Nh{z) — Nh{u) = Nh{z) — N{u), which contradicts 
the assumption that Nh{z)—N{u) = </>. Thus y G Nh{u). It follows that Nh{w) C 
Nh{u). But since dh{u) = dh{w), it has to be the case that Nh{u) = Nh{w). That 
is, w is a, second type witness of Nh{u). □ 

Algorithm To Enumerate Minimal Separators 

Step 1: Find a PEO of G. 

Step 2: (Preprocessing:) For each node uGV, find dh{u) and Zu=lowest{Nh{u)). 
Store these information with the node, for future use. 

Step 3: For each u G V with nonempty Nh{u), do: 

Step 3.1: (Looking for first type witness :) If Nh{zu) — N{u) yf </>, output 
Nh{u). 

Step 3.2: (Looking for second type witness :) If Nh{zu) — N{u) = <j), then if 
there is a node w ^ u and w G Ni{zu) — N{u), such that dh{u) = dh{w) and 
Zw = Zu, then output Nh{u). 

Theorem 3. The above algorithm outputs exactly the set of all minimal sepa- 
rators. 
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Proof: First note that the algorithm outputs Nh{u) only if a witness is found. 
Therefore it outputs only minimal separators. Now every minimal separator is 
Nh{u) for some u G V , by Theorem 2, and has either a first type witness or 
second type witness by lemma 9. Therefore every minimal separator will be 
output. □ 

Theorem 4. The above algorithm runs in 0{m + n) time. 

Proof: Note that implementing step 2 and 3 only involves traversing the list 
of neighbours of each node a constant number of times. Since i7„gy(i(u) = 2m, 
this can be done in 0(m) time. Step 1 can be implemented in 0{m + n) time. □ 

4 Upper Bound for Number of Minimal Separators 
of Chordal Graphs 

Theorem 5. Number of minimum separators < Number of minimal separators 
< n — k{G) — 1. 



Proof : Since every minimum separator is a minimal separator, we just have 
to show the second inequality. It is clear from Theorem 2 that each minimal 
separator is Nh{u) for some interior vertex u. But the number of interior vertices 
is < n — k{G) — 1, by lemma 6. Hence the result. □ 

Finally we show that the above inequality can be tight, by constructing a 
graph Gn{Vn,En), with k{G) = k, and n > /c + 2, for fc > 1, such that the 
number of minimum separators = n — k — 1. This clearly proves that for G„, the 
number of minimum separators = number of minimal separators = n — n{G) — 1. 
For this we just have to show that Vf, the set of interior nodes of G„, contain 
exactly n — k — 1 nodes, and for each u G Vf, dh{u) = k (see Theorem 1). 

First let I = k + 2. Let Vj = {vi,V 2 , ■ ■ ■ ,vi}. Define Gi{Vi,Ei) to be the graph 
obtained by removing the edge (vi,vi) from the complete graph on Vj. Consider 
an ordering fi of Gi defined by fi{vi) = i, for 1 < i < 1. We claim that G; is 
a chordal graph, and fi is a PEO. We leave the details to the reader. Also it is 
easy to verify that /t(G) = k and V{ = {fi}. Thus we have just one, minimum 
separator for G;. 

Now we inductively construct Gn{Vn, E^)- Let G„_i(U-i) be a chordal 

graph with PEO defined by /„_i(ui) = z, for 1 < i < n — 1. Let k(G„_i) = k and 
let Vf_i = {v\,V 2 , ■ ■ ■ ,Vn-k- 2 \- Also let for all u G Vf_i, dh{u) = k and Nh{u) 
distinct from one another. In other words, the number of minimum separators 
of G„_i is exactly {n — 1) — k — 1. 

Now let Vn = U-iU{^n}> where is a new node. Let fni^i) = i, for 
1 < z < n and = E^-i ^ G U-i with fn{vi) = i>n-k-l}. 

We claim that G„ is a chordal graph with a PEO defined by /„. Also it is not 
difficult to show that k(G„) = k. We leave the details to the reader. 

Clearly Vf = U-i U{^n-fe-i}- Note that dh{vn-k-i) = k. Thus we have 
\Vf \ = n-k-l, andVz; G Vf,dh{v) = k{G) and Nh{vn-k-i) ^ Nh{ufiu G Vf_^. 
Thus Gn has exactly n — k(G) — 1 minimum separators by Theorem 1. 
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Abstract. Graph separation is a well-known tool to make (hard) graph 
problems accessible for a divide and conquer approach. We show how 
to use graph separator theorems in order to develop fixed parameter 
algorithms for many well-known NP-hard (planar) graph problems. We 
coin the key notion of glueable select&verify graph problems and derive 
from that a prospective way to easily check whether a planar graph 
problem will allow for a fixed parameter divide and conquer algorithm 
of running time c'^ ■ for a constant c. 



1 Introduction 

Algorithm designers are often faced with problems which, when viewed from clas- 
sical computational complexity theory, are “intractable.” More formally speak- 
ing, these problems can be shown to be AP-hard. In many applications, however, 
a certain part (called the parameter) of the whole problem can be identified which 
tends to be of small size k when compared with the size n of the whole problem 
instance. This leads to the study of parameterized complexity [7]. 

Fixed parameter tractability. Formally, one terms a (parameterized) prob- 
lem fixed parameter tractable if it allows for a solving algorithm running in time 
f{k)nP^^^ on input instance (/, k), where n = |/| and / is an arbitrary function 
only depending on k. We will also term such algorithms “/(/c)-algorithms” for 
brevity, focusing on the exponential part of the running time bound. The as- 
sociated complexity class is called FPT. Of course, designing fixed parameter 
algorithms with a “small” function / is desirable. To our knowledge, so far, only 
one non-trivial fixed parameter tractability result where the corresponding func- 
tion / is sublinear in the exponent, namely f{k) = c'^ [1], is known: planar 
DOMINATING SET. Similar results hold for closely related problems such as face 
COVER, PLANAR INDEPENDENT DOMINATING SET, PLANAR WEIGHTED DOMI- 
NATING SET, etc. [1]. In the companion paper [2], we proved similar results for 
a much broader class of planar graph problems, presenting a general methodol- 
ogy based on concepts such as tree decompositions and bounded outerplanarity. 
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** Supported by the Deutsche Forschungsgemeinschaft (research project PEAL (Pa- 
rameterized complexity and Exact ALgorithms), NI 369/1-1). 



J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 318-327, 2001. 
© Springer- Verlag Berlin Heidelberg 2001 




Graph Separators: A Parameterized View 



319 



Here, we will also discuss a rather general approach for obtaining parameterized 
graph algorithms running in time^ • q{n)) for sublinear functions e, i.e., 

e{k) G o{k). More precisely, we investigate the use of (planar) separator theo- 
rems in this context, yielding an alternative and conceptually rather different 
framework in comparison with [2]. 

General outline. It is a crucial goal throughout the paper not to narrowly 
stick to problem-specific approaches, but to try to widen the techniques as far 
as possible. More specifically, we show how to use separator theorems for differ- 
ent graph classes, such as, e.g., the well-known planar separator theorem due to 
Lipton and Tarjan [8], in combination with known algorithms for obtaining (lin- 
ear size) problem kernels in order to obtain fixed parameter divide and conquer 
algorithms. Special care is taken for the dependency of the running time on the 
“graph separator parameters” (and how they influence the recurrences in the 
running time analysis). We consider a broad class of problems that can be at- 
tacked by this approach, namely, in principle, all so-called glueable select&verify 
problems such as, e.g., planar dominating SET.^ Also, we exhibit the influ- 
ence on the running time analysis of so-called cycle separators. Although the 
constants achieved in our setting so far seem to be too large in order to yield 
practical algorithms, our approach provides a general, sound, mathematical for- 
malization of a rich class of problems that allow for divide and conquer fixed 
parameter algorithms. Our methodology seems to leave much room for improve- 
ment in many directions. For instance, we introduce the novel concept of problem 
cores that can replace problem kernels in our setting. Finally, we give a push to 
the study of subclasses of the parameterized complexity class FPT. In this sense, 
our work also might serve as a starting point for more algorithmic (also concern- 
ing graph theory with respect to separator theorems), as well as more structural 
complexity-theoretic lines of future research in parameterized complexity. 

Due to the lack of space, several details are deferred to the long version. 

2 Basic Definitions and Preliminaries 

We consider undirected graphs G = (V,E), V denoting the vertex set and E 
denoting the edge set. We only consider simple graphs (i.e., with no double edges) 
without self-loops. Sometimes, we refer to M by M (G) in order to emphasize that 
V is the vertex set of graph G; by N{v) we refer to the set of vertices adjacent 
to V. G[D] denotes the subgraph induced by a vertex set D. For graphs Gi = 
(Vi,Ei) and G2 = (V2, E 2 ), by Gi nG2, we denote the graph {V\ fl V 2 , E\ fl S2). 
G' = {V',E') is a subgraph of G = (V,E), denoted by G' C G, if W C ]/ and 
E' C E. In this paper, we only consider graph classes, denoted by G, that are 
closed under taking subgraphs. The most important among these graph classes 
is that of planar graphs, i.e., graphs that have a drawing in the plane without 

^ Here and in the following, n is the number of vertices of the input graph and k is 
the parameter of the considered graph problem. 

^ Lipton and Tarjan [9] only describe a solution for the structurally much simpler 
planar independent set in detail. 
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edge crossings. Among others, we study the following vertex subsets of a graph 
G = (V,E): A vertex cover C C V satisfies that every edge of G has at least 
one endpoint in G. An independent set is a set of pairwise nonadjacent vertices. 
A dominating set D CV obeys that each of the rest of the vertices in G has at 
least one neighbor in D. The corresponding problems are denoted by (planar) 
VERTEX COVER, INDEPENDENT SET, and DOMINATING SET. 

Further conventions: we write A+B to denote the disjoint union of sets A and B. 
We let 0 • (±oo) = 0. 

Linear problem kernels. Let £ be a parameterized problem, i.e., £ is a subset 
of E* X Reduction to problem kernel, then, means to replace instance (/, k) € 
E* X N by a “reduced” instance (/', k') G E* x N (which we call problem kernel) 
such that k' < c - k, |/'| < p{k) with constant c, some function p only depending 
on k, and (7, fc) G £ iff (/', k') G £. Furthermore, we require that the reduction 
from (I,k) to {!' ,k') is computable in polynomial time TR-(|/|,fc). 

Often, the best one can hope for is that the problem kernel is size linear in k, 
a so-called linear problem kernel. For instance, using a theorem of Nemhauser 
and Trotter, Chen et al. [4] observed a problem kernel of size 2k for vertex 
COVER on general (not necessarily planar) graphs. Furthermore, planar in- 
dependent SET has a problem kernel of size 4fc due to the four color theorem. 
Once having a linear size problem kernel, it is fairly easy to use our framework to 
get c'^-algorithms for these problems based upon the famous planar separator 
theorem [8,9]. The constant factor in the problem kernel size directly influences 
the value of the exponential base. Hence, lowering the kernel size is a crucial 
goal. 

Classical separator theorems. Let G = (V,E) be an undirected graph. A 
separator S' C C of G partitions V into two parts Ai C V and A 2 C V such 
that A 1 + S + A 2 = V, and no edge joins vertices in Ai and A 2 ; (Ai,S, A 2 ) is 
called a separation of G. When we restrict our attention to planar graphs, S is 
called cycle separator if it forms a cycle in some triangulation of G. According to 
Lipton and Tarjan [8], an /{■)- separator theorem (with constants a < 1, (3 > Q) 
for a class G of graphs which is closed under taking subgraphs is a theorem of 
the following form: If G is any n-vertex graph in G, then there is a separation 
{Al, S, A 2 ) of G such that neither A\ nor A 2 contains more than an vertices, 
and S contains no more than /3/(n) vertices. 

Stated in this framework, the planar separator theorem due to Lipton and 
Tarjan [8] is a -y/i-separator theorem with constants a = 2/3 and j3 = 2\/2. The 
current record for a = 2/3 is /3 « 1.97 [6]. Similar \£_separator theorems are 
also known for other graph classes, e.g., for the class of graphs of bounded genus, 
see [5]. It is also possible to incorporate weights in most separator theorems. We 
refer to them as /(-)-separator theorems for weighted graphs. 



® In this paper, we assume the parameter to be a positive integer although, in general, 
it might also be from an arbitrary language (e.g., being a subgraph). 
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3 Glueable Graph Problems 



Select&verify graph problems. A set Q of tuples {G,k), G an undirected 
graph with vertex set V = {wi,...,w„} and k a nonnegative real number, is 
called a select&verify (graph) problem if there exists a pair (P. ,opt) with opt € 
{min, max}, such that P. is a function that assigns to G a polynomial time 
computable function of the form Pg = PJ®* + P™*', where PJ®^ : (0, 1}" —>■ K+, 
P^®" : (0, 1}” ^ (0, ±oo|, and 



(G,k)eg 



( optg.g^o,!}’* Pg{x) < k if opt = min, 

} optg^g^Q Pg{x) > k if opt = max . 



For X = {xi, . . . , Xn) G {0, 1}" with Pg{x) < fc if opt = min and with Pg{x) > k 
if opt = max, the vertex set selected by x and verified by Pq is {vi & V \ Xi = 
n}. A vector x is called admissible if Pq^’^(x) = 0. 

The intuition behind the term P. = psei_|_pver jg “selecting function” 

P®®^ counts the size of the selected set of vertices and the “verifying function” 
pv®r verifies whether this choice of vertices is an admissible solution. Every 
select&verify graph problem that additionally admits a problem kernel of size 
p{k) is solvable in time 0{2P^^^p{k) + TK{n,k)). 

We now give two examples for select&verify problems by specifying the func- 
tion Pg = P(|®* -I- Pg®’’. In both cases the “selecting function” for a graph 
G = {V,E) will be PJ®i = Xi- Firstly, in the case of Vertex cover, 

we have opt = min and choose Pq^’^(x) = vj}eE oo • (1 — Xj)(l — Xj). Thus, 
Pg{x) < k guarantees a size at most k vertex cover set. Secondly, for dominat- 
ing SET, we have P^“(a;) = ' ]!{«,, “ ^j))- 

We will also need a notion of select&verify problems where the “selecting 
function” and the “verifying function” operate on a subgraph of the given graph: 
Let P. = P®®* -I- P''®’’ be the function of a select&verify problem. For an n- vertex 
graph G and subgraphs G™" = (V™", G®®‘ = (V®®\ P®®^) C G, we let 



Pcver(a; I G®®^) := P^“er(7Tvver(a;)) + P^®{,(^V-(a=)), 

where Try/ is the projection of the vector x G (0, 1}” to the variables correspond- 
ing to the vertices in V' C V. 

Glueability. We are going to solve graph problems, slicing the given graph 
into small pieces with the help of small separators. The separators will serve 
as boundaries between the different graph parts into which the graph is split. 
For each possible assignment of the vertices in the separators, we want to - 
independently - solve the corresponding problems on the graph parts and then 
reconstruct a solution for the whole graph by “gluing” together the solutions for 
the graph parts. We need to assign colors to the separator vertices in the course 
of the algorithm. Hence, our algorithm has to be designed in such a manner 
that it can also cope with colored graphs. In general (e.g., in the case of the 
DOMINATING SET problem), it is not sufficient to simply use the two colors 1 
(for encoding “in the selected set”) and 0 (for “not in the selected set”). 
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Let us introduce some auxiliary notions. Let G = (V, E) be an undirected 
graph and let Cq,Ci be finite, disjoint sets. A Co-Ci-coloring of G is a function 
X : y — >■ Go + Gi + {#}.■* For y' C y, a function x : y' — >• Go + Gi can naturally 
be extended to a Go-Gi-coloring of G by setting x(w) = # for all u G y \ y'. 

Consider a vector x G {0, Ill'll. Let x be a Go- Gi -coloring of G. Then, 
X is consistent with x? written a: ~ x? for * = 0 , 1 and j = l,...,|y|, 
^ ^ ~ L ■ 

If X is a Go-Gi-coloring of G and if x' is a Gg-G^-coloring of G, then x is 
preserved by x^ written x X^ if Vu G y Vi G {0, 1} (x(v) & Ci ^ x'W) G C''). 

In the next section, when doing the divide and conquer approach with a given 
separator, we will deal with colorings on two different color sets: one color set 
Gint ._ internal coloTS that will be used for the assignments 

of colors to the separator vertices and a color set G®’^* := Gg’^* -I- Gf’^* + {#} 
of external colors that will be used for handing down the information in the 
divide-step of the algorithm. The idea is that, in each recursive step, we will be 
confronted with a graph “pre-colored” with external colors. Every function © 
that assigns to a pair (x®^‘, x“‘) with x'”'* : V G®^‘, x“‘ : y ^ G“‘, x®’'* 
X™*, a (Gg’^*-Gf’^*)-coloring x^’^^Sx'”* is called a recoloring if x™* X®’'*®x“*- 

From the point of view of recursion, x®’^* is the pre-coloring which a certain 
recursion instance “receives” from the calling instance and x™* represents a 
coloring which this instance assigns to a certain part of the graph. The coloring 
X 6 xt 0 ^mt jg down in the recursion. 

We now introduce the central notion of “glueable” select & verify problems. 
This formalizes those problems that can be solved with separator based divide 
and conquer techniques as described above. 

Definition 1. A select&verify problem Q given by (P. ,opt) is glueable with a 
colors if there exist 

• a color setC^'^* := Gg™‘+G™*+{#} of internal colors with |Gg®”*+G“*| = cr; 

• a color set G®^* := G“* + + {#} of external colors; 

• a polynomial time computable function h : (K.+ U {±oo})^ — >■ R+ U {±oo}; 

and, for every n-vertex graph G = {V, E) and subgraphs G”®”, G*®^ C G with a 
separation {Ai, S, A 2 ) ofG^’^^, we find 

• recolorings ©x for each X G {Ai, S, A 2 }, and 

• for each internal coloring x“* : S — >■ G®”*, subgraphs G'^f{x™*) of G'"^'^ with 
G”®”[A,] C Gjf”(x®”‘) C G®'®”[A, + 5] for i = 1,2, and subgraphs G^®”(x®"‘) 
of G”®” with g|®”(x®"‘) C G”®”[S'] 

such that, for each external coloring x^^ '■ V G^^*, 

opt{PG^x I G*®') I a: G {0, 1}" A a: ~ X^"‘} (1) 

= opt,,int,s^c“‘ + c“‘ h{EvalAiix'''^*), Evalsix"""*), EvaUAx"'^*)) ■ 

^ 17% t 

The symbol # will be used for the undefined (i.e., not yet defined) color. 



4 
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Here, Evalx{-) for X G {Ai,S', A 2 } is of the form Evalxix^"'*) = 

opt{PGr(x-)(® I n I G {0, 1}" A a; ~ (x^"‘ ©x x“‘)}- 

Lemma 1. Vertex cover and independent set are glueable with 2 colors 
and DOMINATING SET is glueable with 4 colors. 

Proof. For Vertex cover, we use the color sets Cf := {i^} for t G {int,ext} 
and t = 0, 1. The function h is h{x, y, z) = x+y+z. The subgraphs G™''(x‘"*) for 
V G {Ai,S,A 2 } and : S Go“ + Gi“* are G^"'(x“‘) := G™"'[V], In this 
way, the subroutine Evals(x“*) checks whether the coloring x™* yields a vertex 
cover on G™“'[5'] and the subroutines Evalxi(x'”*) compute the minimum size 
vertex cover on G™“'[Ai]. However, we still need to make sure that all edges going 
from Ai to S are covered. If a vertex in S is assigned a I™‘ by x“*) the incident 
edges are already covered. In the case of a 0 “^-assignment for a vertex v G S, 
we can color all neighbors in N{v) fl Ai to belong to the vertex cover. This is 
done by the following recolorings “©^^.” Define 

( 0®’^* if x“(^^) = 0“*, 

©Ai X™*)(^) = S h x‘”*(^) = 1™* or 3w G N{v) with x‘”*(^c) =0“*, 

[ #, otherwise. 

By this recoloring definition, an edge between a separator vertex and a vertex 
in Ai which is not covered by the separator vertex (due to the currently consid- 
ered internal coloring) will be covered by the vertex in Ai. Our above reasoning 
shows that - with these settings - equation (1) is satisfied. 

Independent set is shown to be glueable with 2 colors by a similar idea. 
Regarding dominating set, we refer to the full paper. □ 

We want to mention in passing that - besides the problems given in Lemma 1 
- many more select&verify problems are glueable. In particular, this is true for 
the weighted versions and variations of the above mentioned problems. 

4 Fixed Parameter Divide and Conquer Algorithms 

For the considerations in this sections, let us fix a graph class G for which 
a v^-separator theorem with constants a and l3 is known. Then, we consider a 
select&verify graph problem Q defined by (P., opt) that is glueable with cr colors. 

4.1 Using Glueability for Divide and Conquer 

The evaluation of the term opt^jg^g Pg{x) can be done recursively as follows. 
Start the computation with opt{PG(^r) I x G {0,1}”} = optjPc^d® I G®®') | 
a; G {0, 1}”, a: - x8’'*}. where “x^* = #” and G™^ = G®®^ = G. 

When opt,jg{g I G®®^) needs to be calculated for some 

G®®i,Gver C G, and an external coloring x®’'* : V{G) -G G^’^* + Gf * +{#}, 
we do the following: 
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1. If G'™'' has size greater than some constant c, then find a -y^-separator 
for G™" with y(G™*') = A 1 + S + A 2 . 

2. For all internal colorings x'”* of S with x®’^* X'”* do: 

(a) Determine EvalAj(x'”*) recursively for f = 1, 2. 

(b) Determine Evals(x'”*)- 

3. Return opt^i„t_^ext^^mt h(EvaUj(x™*),Evals(x™‘),EvaU 2 (x™*))- 

The size of the subproblems, i.e., the size of the graphs G™''(x“*) which are 
used in the recursion, plays a crucial role in the analysis of the running time of 
this algorithm. 

Definition 2. A glueable select&verify problem is called slim if the subgraphs 
G™^(x™*) are only by a constant number of vertices larger than G'"‘^'^[Ai], i.e., if 
there exists arj > 0 such that |E(G^®’'(x“*))l ^ 1^*1 for all internal colorings 

.^int . ^ y ^ int 

Note that the proof of Lemma 1 shows that both vertex cover and indepen- 
dent SET are slim with 77 = 0, whereas dominating set is not, as exhibited in 
the paper’s long version. The following proposition gives the running time of the 
above algorithm. In order to assess the time required for the above given divide 
and conquer algorithm, we use the following abbreviations for the running times 
of certain subroutines: Ts{n) denotes the time to find a separator in an n- vertex 
graph from class G. TM{n) denotes the time to construct the modified graphs 
G™‘'(x™*) G G and the modified colorings (x®’'* ®x x'”*) (for X = {Ai,S,A 2 } 
and each internal coloring x“* with x*’’*^* X™*) from an n- vertex graph from 

class G. Te(to) is the time to evaluate Evals(x“*) for all x™*? X™*? iii ^ 

separator of size m = P^/ri. Tg(ji) is the time for gluing the results obtained by 
two sub-problems each of size 0{n). In the following, we assume that all these 
functions are polynomials. 

Proposition 1. For every G G G, opt^^g^p Pg{x) can be computed in time 
c{a',P,a)'^q{n), where c{a' , 

Here, a' = a+e for any e G (0, 1— a) and the running time analysis only holds for 
n > 770 (e), and q is some polynomial. If, however, Q is slim or the yf--separator 
theorem yields cycle separators, then the running time for the computation is 
c{a, P,a)^q{n), which then holds for all n. 

Proof. (Sketch) Let T{n) denote the time to compute opt^^g^o,!}", (® I 

Gsei) £qj- graph G™“' = E™’') with n = |E™’'| (where • V{G) —>■ 

Gq’^* -I- G®’^* -|-{#} is some external coloring and G®®',G™'' C G). In the case of 
the existence of a cycle separator theorem or if the problem is slim, the recur- 
rence we have to solve in order to compute an upper bound on T{n) then reads 
as follows: 

T{n) < ■ 2T{an) ■ {TM{n) + TEifdy/n) + Tg{an + (3\/n)) +Ts{n). 

-V 

= 'TM,E,g{n) 
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The functions TM,E,g(ji) and Ts{n) are polynomials. The solution is given by 
T(n) < for some polynomial q{-). In the general case, from 

the definition of glueability, we have that the size of the two remaining subprob- 
lems to be solved recursively is < an + (3^/n for each y'"* that 

preserves Since with a' = a -|- e, for some e € (0, 1 — a), the inequality 
an + [3y/n < a'n holds for sufficiently large n, the result follows. □ 

4.2 How (Linear) Problem Kernels Help 

Proposition 1 together with the existence of problem kernels yields: 

Theorem 1. Suppose that Q admits a problem kernel of polynomial size p{k) 
on G computable in time Tx{n, k). Then, there is an algorithm to decide (G, k) G 
Q, for a graph G G G, in time 

c{a' , (3, + Tx{n, k), where c{a' , P, a) = (2) 

and of = a + t for any e G (0, 1 — a), holding only for n > no(e), where q{-) is 
some polynomial. If, however, Q is slim or the yf--separator theorem yields cycle 
separators, then the running time for the computation is c{a, q{k) + 

Tk{u, k), which then holds for all k. □ 

In particular, this means that for glueable select&verify problems for planar 
graphs that admit a linear problem kernel of size dk, we get an algorithm of run- 
ning time 0 (c(q;', /3, a, d)'^q{k)+TK{n, k)), where c(a', P, a, d) = 

Since vertex cover is a slim problem, Theorem 1 yields a c'^^-algorithm 
for Gg, where Gg denotes the class of graphs of genus bounded by g, see [5]. 

4.3 Towards Avoiding (linear) Problem Kernels: The Core Concept 

We are going to introduce the novel notion of problem cores, which is closely 
related to that of problem kernels, but seemingly “incomparable” and tailored 
towards unweighted minimization select&verify problems. The idea is to restrict 
(only) the size of the “selection space”, while - unlike in the setting of prob- 
lem kernels - the whole (possibly large) problem instance may be still used for 
“checking” . 

Definition 3. Consider an unweighted select&verify minimization graph prob- 
lem G specified by (P. , min). A corer of size p{k) is a polynomial time computable 
mapping p : {{V,E),k) W satisfying \Vc\ < p{k), and 3® = {xi, . . . ,x\v\) G 
{0, Ill'll (P g(3^) C: k a {vi G P I = 1} C Vc). The set Vc is also called the 
problem core ofG- If p{k) = ak, we call p a linear corer. In this case Vc is called 
a factor-a problem core. 

Note that weighted minimization problems could also be treated similarly at the 
expense of further technical complications. Having a problem core automatically 
makes a select&verify problem a “simple” one: For the problem core Vc, which 
can be computed in polynomial time, it is enough to check all fc-element subsets, 
giving fixed parameter tractability. Stirling’s formula yields: 
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Lemma 2. If a select&verify problem Q has a size ak problem kernel or if the 
core is a factor-a core, then there is a “mm{(ea)^ ,2'^'^}-algorithm” for Q . □ 

Even though there seems to be no general interrelation between problem kernels 
and cores, for our purposes, the different concepts can be interchanged: 

Theorem 2. Let G be an n-vertex- graph from a graph class G for which a yf- 
separator theorem for weighted graphs is known with constants a, j3. Suppose 
that G admits a corer of size p{k), which can be computed in polynomial time 
Tc{n). Then, there is an algorithm to decide (G,k) G G, for G € G, in time 

c{a' , P,a)'^^^^q{k) + Tc{n), where c{a' , fi,a) = 

and a' = q: + e for any e G (0, 1 — a), holding only for n > no(e). If, however, 
G is slim or the -separator theorem yields cycle separators, then the time for 
the computation is c{a, P,a)'^^^^q{k) + Tc{n), which holds for all k. 

Proof. (Sketch) Consider G = (V,E) G G. The algorithm proceeds as follows: 

(1) Compute a core Vc QV containing at most p{k) vertices in time Tc{n). 

(2) Then, G[Vc] G G is the graph from which vertices have to be selected. 

(3) Find an optimal x satisfying Pg{x \ G[Ec]) by applying the algorithm out- 

lined before Prop. 1. One modification of this algorithm, however, is necessary: 
we do not use a separator theorem in step 1 of that algorithm for a separation 
of G™*', but we add weights to G™“' as follows: If induces a weight function oj on 
G™*' by letting uj(v) = l/|K|j if r' G V), and uj(v) = 0, otherwise. Then, we apply 
a separator theorem for weighted graphs to with the weight function cu. 
The constants c{a,f3,a) and c{a' ,j3,a) then are derived as in Prop. 1. □ 



5 Conclusion and Further Results 

Summary. We exhibited how to use separator theorems for obtaining c'^-fixed 
parameter divide and conquer algorithms. We defined “glueable select&verify 
problems,” capturing graph problems such as vertex cover and dominating 
SET, as a problem class that allows for a divide and conquer approach on certain 
graph classes. Admittedly, the constants within these algorithms are still rather 
huge. For example, in the case of planar vertex cover, Theorem 1 yields a 
3718i\/fe-aigorithm. In the long version, we elaborate on several ideas to overcome 
this weakness: 

Further results (see the full version of the paper). Firstly, we analyze how 
Lipton and Tarjan proved their famous planar separator theorem; basically, the 
proof consists of two steps: in the first step, the given graph is thinned into pieces 
of “small” radius, and in the second step, a special (cycle) separator lemma 
for planar graphs with bounded radius is used. The such obtained separator 
therefore consists of two parts. Since after one application of Lipton and Tarjan’s 
separator theorem (in a divide and conquer algorithm as described in Section 4), 
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the remaining graph pieces still have relatively small radius, one could avoid the 
first thinning step. Iterating this idea in an optimized fashion, one gets, e.g., 
in the case of planar vertex cover, a 8564'^-algorithm. It is a challenge if 
better algorithm bounds are obtainable by using other separator theorems. 

Secondly, we discuss the idea of stopping the recursion before having graph 
parts of constant size and then applying, e.g., elaborated search tree algorithms 
to these small parts. More precisely, a divide and conquer algorithm would once 
use Lipton and Tarjan’s planar separator theorem for slicing the graph into 
pieces of small radius and then use only the mentioned special (cycle) separator 
lemma in the remainder of the recursion, until all graph pieces are sufficiently 
small. In this way, one gets ^ -algorithms with reasonable small constants c. 
For example, we derive a 7.7670^^^^-algorithm for planar vertex cover. 
Future research. We briefly sketch three lines of future research. (1) An al- 
ternative idea in order to lower the involved constants would be to devise new 
separator theorems with constants a and /?, not only concentrating on bringing 
down f3 for fixed a (as, e.g., done for a = 2/3 in [6]), but on minimizing the 
function /3/(l — \/a). (2) It is an issue of future research to further investigate 
the newly introduced concept of cores. For example: is there a linear size core 
(or kernel) for planar dominating set? (3) Finally, this paper (together with 
[1,2]) might be a stimulus to investigate the structure “within” FPT in order to 
distinguish problems allowing for c'^-algorithms from problems which seem to 
allow only for c^-algorithms. Is it possible to develop a reasonable fine-grained 
structural theory of FPT? Note that Cai and Juedes [3] very recently showed 
that for a list of parameterized problems (e.g., for vertex cover on general 
graphs) c°^^^-algorithms are impossible unless FPT = W[l]. 
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Abstract. For a graph G on n vertices, with positive integer weights 
wi, . . . ,Wn assigned to the n vertices such that, for every clique K of 
G, ^ ^ < 1, the problem we are interested in is to assign binary 

ieK 

codes Gi, . . . , C„ to the vertices such that Ci has Wi (or a function of 
Wi) bits in it and, for every edge {i,j}, Ci and Cj are not prefixes of 
each other. We call this the Graph Prefix Free Code Assignment Problem. 
We relate this new problem to the problem of designing adversaries for 
comparison based sorting algorithms. We show that the decision version 
of this problem is as hard as graph colouring and then present results on 
the existence of these codes for prefect graphs and its subclasses. 



1 Introduction 

Our study of the Graph Prefix Code Assignment problem is motivated by the 
question of designing adversaries for sorting under partial information (discussed 
below). This motivation was suggested by Subrahmanyam and Dubhashi [1]. 

The problem of sorting under partial information is: Given a partial order 
P on a set X = {xi, . . . , Xn}, determine a hidden linear extension of P using 
questions of the form is Xi < Xj ?. An algorithm to solve this problem is said to 
sort P by comparisons. It is also referred to as a comparison based algorithm 
for sorting. In this paper we only consider comparison based sorting algorithms. 

A comparison based sorting algorithm can be seen as a game between two 
players D and H. In this game D has a partial order P and PI holds a linear 
extension of P, T, hidden from D. D tries to discover T by asking questions 
about the relation in T (comparisons) between pairs of elements. D has to make 
log e(P) comparisons to discover T where, e(P) is the number of linear exten- 
sions of P. An interesting question here is the issue of designing an adversary 
who answers on-line, the comparison queries of an algorithm, so as to force the 
algorithm to use log e(P) comparisons. This problem was solved by Brodal et 
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al. in [7] for the case when P is the anti-chain on X. They design a very simple 
and elegant adversary that answers the comparison queries of any sorting algo- 
rithm to force it to use f2(logn!) comparisons. For the general case when there 
is no restriction on P, such an adversary was designed by Kim and Kahn in [2] 
using a relation they prove between e(P) and the entropy of the comparability 
graph of P(defined below). The adversary answers each comparison query de- 
pending on the outcome of an entropy calculation (depending on P and answers 
to past queries). A question that remains open in this area, posed in [1], is to 
design an adversary for sorting under partial information that avoids the entropy 
calculations that are an integral part of the adversary designed in [2] . 

In this paper we present the relation between the design of adversaries for 
sorting under partial information and the Graph Prefix free Codes assignment 
problem. We present the first results on the prefix free codes problem. Our results 
are stated after the preliminaries in section 2. 

2 Preliminaries and Our Results 

A collection of positive integers wi, . . . ,Wr is said to satisfy Kraft ’s Inequality if 
V’' — < 1 

Throughout the paper, G is an undirected graph with positive integer weights 
wi, . . . ,w„ assigned to its vertices vi, ... ,Vn- The weights satisfy the property 
that for every clique in G, the weights assigned to its vertices satisfy Kraft’s 
inequality. The weights assigned to the vertices of a graph G are said to sat- 
isfy Kraft’s inequality if the induced weight assignment to every clique in G 
satisfies Kraft’s inequality. Unless mentioned otherwise the weight assignment 
(wi, . . . , w„) to vertices (vi, ... , v„) implies Vi is assigned the weight Wi. A simi- 
lar convention holds for the code assignment (Gi, ... , Gn). Two codes are prefix 
free if neither is a prefix of the other. 



Prefix Free Code Assignment Problem for Graphs: For a given positive 
integer c, a graph G with weights Wi, . . . , Wn assigned to vertices Vi , . . . , 
satisfying Kraft’s inequality, does there exist an assignment of binary codes 
Cl , . . . , C„ to the vertices wi , . . . , such that number of bits in Gi is less than 
or equal to Wi + c, and for every edge {i,j} in G, and such that Gi and Gj are 
not prefixes of each other? 

We call such a code a +c prefix free code for {G,{wi, . . . ,Wn)). We use 
(G, (wi, . . . , Wn)) to denote G with the weights {wi, . . . , Wn) assigned to its ver- 
tices. 



Prefix Free Codes and Tree Embeddings: Here we present the relation 
between prefix free codes and a partial order. Let T denote the infinite binary 
tree. With each node of T, associate a binary string in a natural way: the root 
is associated with the empty string, and recursively, if a node is associated with 
a string w, its children are associated with the sequences wO and wl. T can 
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be seen as a partial order with the elements related by the ancestor-descendant 
relationship. Consequently, it follows that two codes are prefix free if and only 
if they form an anti-chain in T . Therefore, our question boils down to assigning 
vertex Vi,i G {1, . . . , n}, to a node in the tree T at level Wi + cor lesser such that 
the nodes assigned to adjacent vertices in the graph form an anti-chain in T. 
This problem is now a natural variant of the Graph Vertex Colouring problem. In 
the Graph Colouring problem, the colours or codes to be assigned to the vertices 
come from the partial order in which any two distinct elements are unrelated. 
Therefore, two adjacent vertices must be given colours that are unrelated in this 
partial order. On the other hand, in the Prefix Free code assignment problem, we 
are given a tree partial order, and weights to the vertices. The code assignment 
criterion is that two adjacent vertices get unrelated codes in the partial order 
and, there is a restriction on the layers where the codes come from. 

Definitions Related to Partial Orders and Graph Entropy: The com- 
parability graph, G{P), of a partial order P on a set X = {x\, . . . ,x„}, is an 
undirected graph with X as its vertex set, and {xi,Xj} G E{G{P)) iff Xi and Xj 
are comparable in P. 

We write P{x < y) to denote the partial order obtained from P by adding 
the relation x < y. We refer only to a partial order without referring to the set 
on which it is defined when there is no confusion about the underlying set. 

The vertex packing polytope, VP{G), of a graph G with P as a vertex set, is 
the convex hull of the set of indicator vectors of stable (independent) sets of V. 
Recall that the indicator vector, I a, of a set ^ C V, is a |P|-dimensional vector 
in which (lA)i = 1 if the i-th vertex of V is in A. Otherwise, it is zero. 

The graph entropy of G with respect to a probability distribution ponV (G) 
is 

n=\V\ 

H{G,p)= min - 'V pAogai 
aeVP(G) ^ 

1—1 

In this paper we are interested only in the case when p is the uniform distribution 
on the set of vertices and therfore, entropy is denoted by H{G). The entropy of 
the comparability graph of a partial order P is denoted by H{P). We will use 
the phrase entropy of a partial order P to mean the entropy of the comparability 
graph of P. 

2.1 Our Results 

We first present the relation between the prefix free codes problem and the 
problem of designing adversaries for sorting under partial information. Using 
the similarity of the prefix free code assignment problem to graph colouring, we 
show that 

— For an input (G, {wi , . . . , rc„)) where wi, . . . , Wn satisfy the Kraft’s inequal- 
ity, there is no polynomial time algorithm to decide if there is a -|-0-prefix 
free code unless P = NP. 
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We address the problem of the existence of these codes and prove that 

— For every positive integer c, there is a graph G, and a vertex weight assign- 
ment {wi, . . . , Wn) such that there is no -|-c-prefix free code. 

We then observe that the graphs we construct in the above proof are imperfect 
graphs, that is, graphs for which there is an induced subgraph whose clique 
number and chromatic number are different. So we concentrate our exploration 
on perfect graphs. This direction is also consistent with our goal of designing 
adversaries for sorting where, the graphs that arise are comparability graphs 
(this class of graphs are known to be perfect). 

~ We first construct a perfect graph along with vertex weights for which there 
is no -|-0-prefix free code. We then show that for any perfect graph with 
weights {w\, . . . , Wn) satisfying Kraft’s inequality, there is a prefix free code 
assignment whose code lengths are within a multiplicative factor of 2 from 
the specified weights. 

We then investigate the problem for specific classes of perfect graphs namely 
chordal graphs. Specifically, 

— we construct a chordal graph G and associated vertex weights w\, . . . ,Wn 
such that (G, {w\, . . . , Wn)) does not have a -|- 1-prefix free code. 



Plan of the Paper. In Section 3 we present the relation between designing 
adversaries for sorting under partial information and the prefix free code as- 
signment problem. In Section 4 we present the hardness result for the prefix 
free code assignment problem. In Section 4.1 we prove non existence results for 
-|-c-prefix free codes for general graphs. In Section 4.2 we study the problem in 
relation to the perfect graphs. Section 5 presents our results on chordal graphs. 

3 Motivation for the Prefix Free 
Code Assignment Problem 

In this section we present the connection to the problem of designing adversaries 
for sorting under partial information. We first present the adversaries due to 
Brodal [7] and Kim and Kahn [2]. We then point out a method for designing an 
adversary, which leads to the prefix free code assignment problem. To make the 
distinction between the two adversaries, we refer to the adversary of Brodal as 
the adversary for sorting and the adversary of Kim and Kahn as the adversary 
for sorting under partial information. 

3.1 Adversary for Sorting 

The adversary that we present is one that forces any sorting algorithm to use 
l7(logn!) comparisons. 
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The Adversary. We describe the adversary of [7] for answering comparisons 
made by a sorting algorithm. At the outset (before any comparisons are made), 
the adversary places all the elements of X at the root of an infinite binary tree. 
For X G X, let v{x) denote the node of the tree to which x has been assigned. 
The adversary answers a comparison between elements x,y G X based on the 
follows cases: 

— v{x) = v{y)\ Then x is moved to the left child of v{x) and y to the right 
child and the adversary answers x < y. 

~ v(x) is a descendant of v(y): y is moved to the unique child of v{y) that 
is not an ancestor of v{x). If this child is the left child then the adversary 
answers y < x and if it is a right child it answers x < y. 

— v{x) and v{y) are unrelated: If v{x) is visited before v{y) in a pre-order 
traversal of the tree, the adversary answers x < y and otherwise the adver- 
sary answers y < x. 

It is proved in [7] that this adversary forces any sorting algorithm to use at 
least comparisons. The key ingredients in the proof of this claim are the 

following: 

— In the worst case, each comparison pushes two elements down by one level 
in the tree. 

— At the end of the algorithm, each node of the tree will contain at most one 
element. 

— Sum of the distances from the root to nodes that contain an element of X is 
at least n log n and this is equal to sum of the number of levels each element 
has been pushed down. This statement is the most important ingredient in 
the proof of the claim. 

— Each comparison contributes at most two to the sum in the previous item. 

Hence, the number of comparisons made is at least ” . 

3.2 Adversary for Sorting under Partial Information 

Here we describe the adversary for algorithms that discover a linear extension of 
a partial order P that is given to them. The adversary described below answers 
the comparison queries on-line, in a manner so as to force the algorithm to 
make at least log e{P) comparisons. The adversary is based on the following two 
theorems proved in [2] which relate the number of linear extensions of P and 
H{P). 

Theorem 1. For any partial order P , 

n(logn — H{P)) > log e{P), 
loge(P) > maxjlogn! — nH{P),Cn{logn — H{P))}, 

C = 

Theorem 2. For any partial order P and any incomparable elements x,y G P, 
min{P[{P{x < y)),F[(P{y < x))} < FI(P) + 

n 
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The Adversary. We describe the answers the adversary gives to a comparison 
query made by an algorithm that knows a partial order P. The adversary answers 
a comparison between elements x, y based on the follows cases: 

— X and y are comparable in P: The adversary answers exactly the relation 
between x and y in P. 

— X and y are incomparable in P: The adversary applies theorem 2. The ad- 
versary answers x < y if H{P{x < y)) = min{P[{P{x < y)),P[{P{y < x)). 
Otherwise the adversary answers y < x. 

In the first case, after getting the answer from the adversary, the algorithm still 
knows only P. Whereas, in the second case, the algorithm knows a new partial 
order which is either P(x < y) or P{y < x) depending on the answer given by the 
adversary. It is proved in [2] that this adversary forces any algorithm for sorting 
under partial information to use l7(loge(P)) comparisons. The key ingredients 
in the proof of this claim are the following: 

— The entropy of the comparability graph of a total order is logn. This is 
because the comparability graph of a total order is a clique. 

— By theorem 2, each comparison yields a new partial order whose entropy is 
at most - more than the entropy of the current partial order. 

— The above two observations yield a lower bound on the number of compar- 
isons, namely, (logn — H{P))f^. From theorem 1 this is at least 

3.3 A Simpler Adversary for Sorting Partial Orders 
and the Prefix Ftee Code Assignment Problem 

We now describe a new adversary for sorting under partial information. This 
adversary starts by placing the elements of X at nodes in the infinite binary 
tree. The placement is based on the partial order P. The answers to the com- 
parison queries are given using the rules of the adversary for sorting. We state 
the following theorem that describes the placement of the elements of X in the 
tree: 

Theorem 3. Let P he a partial order on a set X, |A| = n. Let (oi, . . . , a„) be 
a point in V P{G{P)) such that H{G{P)) = —logOi. Let every element of 
X be assigned to a corresponding node of the infinite binary tree satisfying the 
following properties (tree embedding condition): 

— For every x,y € X, if x and y are comparable in P, then the nodes in the 
tree assigned to x and y are unrelated in the tree partial order. 

— Element i € X is assigned to a node in the infinite binary tree at distance 
[— log ai\ from the root. 

Consider an adversary who: 

— Starts with the above placement of elements in the infinite binary tree. 

— Answers the comparison queries of any algorithm sorting P using the rules 
of the adversary for sorting. 
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This adversary will force any algorithm for sorting under partial information to 
use at least comparisons. 

Proof. As in the analysis of the adversary for sorting, when the sorting algo- 
rithm ends, the adversary would have placed each element of A to a tree node 
bijectively. The sum of distances from the root to nodes containing elements 
of X is nlogn. The distance from root of an element finally consists of two 
components: its distance from the root in the tree configuration the adversary 
starts with, and the distance increase due to comparisons. For each comparison, 
the contribution to the sum of distances increases by at most 2. Therefore, it 
follows that nlogn < X^r=i log + 2x number of comparisons. Using the 
fact that H{P) = ^ “logOj, simple arithmetic and theorem 1 it follows 
that number of comparisons is at least . 

This lower bound on the number of comparisons is of significance for partial 
orders for which e(P) is large. 



Remarks on the Tree Embedding Condition in the Theorem: The con- 
dition in the theorem requiring that two related elements in P be assigned two 
unrelated nodes in the tree partial order is same as asking for the codes as- 
signed to them to be prefix free. The question that is of interest is. Does the tree 
embedding condition hold for every partial order P with respect to the point in 
VP{G{P)) that achieves H{P)? Furthermore, there is no specific reason as to 
why the tree embedding condition should hold only with respect to a point at 
which entropy of P is achieved. 

Also, from [3] it is true that VP{G{P)) consists of exactly those vectors 
(wi, . . . , Vn) G [0, 1]" for which — 1 whenever {xi £ Xji G /} is a clique 

in G{P). This statement is true for all perfect graphs and G{P) is a perfect 
graph. The above remarks lead to the following problem: 

Let P be a partial order on a set of n elements. Let wi, . . . ,w„ be positive 
integer weights assigned to the vertices of G(P) such that G 

VP{G{P)). Do there exist binary codes Ci, . . . , C„ to the vertices such that Ci 
has Wi bits in it and, for every edge {i,j} G G{P), Gi and Gj are not prefixes of 
each other? 

The above problem can be asked for any undirected graph. For an undirected 
graph there is a simple necessary condition for the existence of prefix free codes: 
the weights (wi,...,w„) should satisfy the Kraft’s inequality. If the weights 
satisfy Kraft’s inequality then . . . , G VP{G{P)) [3]. Therefore, for 
arbitrary graphs we get the Graph Prefix Free Code assignment problem: 

Let G be a graph on n vertices, with positive integer weights wi, . . . ,w„ 
assigned to the n vertices such that the Kraft’s inequality is satisfied. Is there an 
assignment of binary codes Gi , . . . , G„ to the vertices such that Gi has Wi bits 
in it and, for every edge {*, j}, Gi and Gj are not prefixes of each other? 
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4 Hardness and Existence Results 

In this section, we show that the there is no polynomial time algorithm to decide 
if (G, {wi, . . . , Wn)) has a +0-prefix free code assignment unless P = NP. 

Theorem 4. Given a graph G, and a weight assignment {wi, . . . ,Wn) that sat- 
isfies the Kraft’s inequality, there is no polynomial (in the size of the graph) 
time algorithm to decide if {G,{w\, ... ,Wn)) has a -\-0 -prefix free code, unless 
P = NP. 

Proof. The proof is by showing that a polynomial time algorithm to decide the 
existense of a +0-prefix free code can be used to obtain in polynomial time 
algorithm for an iVP-complete problem. The problem we consider is that of 
finding in polynomial time a constant factor, less than 2, approximation to the 
minimum colouring of an undirected graph. Our approximation algorithm would 
find the smallest i,l < i < logn such that (G, (i, . . . ,i)) has a +0-prefix free 
code. This prefix free code assignment would be a 2* colouring of G. Since i is 
the smallest value for which there is a 2* colouring, this gives a constant factor, 
less than 2, polynomial time approximation algorithm. 

4.1 Existence Results 

For every positive integer c, we observe that there is a graph G and a vertex 
weight assignment such that (G, (rui, . . . , w„)) does not have a +c prefix free 
code. 

Theorem 5. For every positive integer c, there is a graph G with vertex weights 
{wi , . . . , Wn) that satisfies Kraft’s inequality, and (G, {w \, . . . , w„)) has no +c- 
prefix free code. 

Proof. We observe that there exist graphs in which the largest clique is an edge, 
but the chromatic number is arbitrarily large. In particular, for any positive 
integer c, let Gc be a graph whose largest clique is an edge and the chromatic 
number (minimum colouring number) is Let the weight assignment of Gc 

be (1, . . . , 1). This weight assignment satisfies Kraft’s inequality in Gc. Clearly, 
G does not have a +c prefix free code assignment. 

The construction of graphs with a fixed largest clique number and arbitrarily 
large chromatic number was first given by Lovasz. A construction of such a 
graph is presented in the book by Bondy and Murthy [4]. Clearly, the graph 
that we use in the construction is an imperfect graph, that is graphs which have 
an induced subgraph whose chromatic and clique number are different. We next 
consider the prefix free code assignment problem for the case when the graphs 
are perfect (meaning, not imperfect). The exploration on this class of graphs is 
well motivated given that comparability graphs are perfect graphs. 

4.2 Prefix Free Codes for Perfect Graphs 

We next show that the perfectness of a graph is insufficient for +0-prefix free 
codes to exist. 
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Fig. 1. This graph doesn’t have a +0 prefix free code for the shown weights. 

A Perfect Graph with no +0-Prefix Free Code: The graph we present 
here is an Interval Graph. Interval graphs are intersection graphs of intervals 
on a line and are known to be perfect [5]. The graph I that we construct is 
the graph in Figure 1. It can be shown that (/, (I, 2, 2, 2, 1)) does not have a 
+0-prefix free code. This construction shows that there exist perfect graphs and 
weight assignments that do not have a +0-prefix free code. We improve on this 
construction and show in the Section 5 a similar result for a sub class of perfect 
graphs namely chordal graphs. Now we use the fact that the colouring problem 
can be solved in polynomial time for perfect graphs, to assign a prefix free code 
whose lengths are within a multiplicative factor of 2. 

Prefix Free Codes within a Constant Factor of the Weight for Perfect 
Graphs Here we show that the satisfaction of Kraft’s inequality by the weights 
(ici, . . . , Wn) for a perfect graph G is sufficient for a prefix free code (Ci, . . . , C„) 
to exist such that the number of bits in Ci is at most 2wi. 

Theorem 6. Let G be a perfect graph with vertex weights {wi, . . . ,w„) that 
satisfy Kraft’s inequality. G has a prefix free code {Ci, . . . ,Cn) such that the 
number of bits in Ci is at most 2wi . Moreover, such an assignment of prefix free 
codes can be computed in polynomial time. 

Proof. We prove the theorem by constructing a prefix free code. The construc- 
tion is as follows: Let there be r distinct weights such that Wi < . . . < Wr 
among the weights {wi, . . . ,Wn). Consider the r induced subgraphs of G, de- 
noted Gi, . . . ,Gr. Gi consists of vertices whose weight is Wi in G. By definition, 
each of the graphs Gi, . . . ,Gr is perfect. We find an optimal colouring in poly- 
nomial time for each Gi. Due to the perfectness of Gi and the Kraft’s inequality, 
the optimal colouring of Gi uses at most 2™* colours. This colouring gives a 
prefix free code for each Gi in which every code has Wi bits. Now for each i, 
we prefix the codes given to the vertices in Gi by i — 1 Is followed by a 0. As 
i < Wi, a, vertex of weight Wi gets a code of length at most 2wi. Clearly, the code 
assignment is prefix free. The above construction can be run in polynomial time 
using the polynomial time algorithm for perfect graph colouring [3]. Hence the 
theorem. 
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5 Prefix Free Codes for Chordal Graphs 

We construct a chordal graph and a weight assignment for which there is no +1- 
prefix free code. This construction can be extended to give a chordal graph which 
has no +c-prefix free code for any positive integer c. We present the construction 
of the graph in a step by step manner. 

— Start with a clique on 2™ vertices, with each vertex of weight w. Clearly this 
weight assignment satisfies Kraft’s inequality. Let Go denote this graph. 

~ For each sub-clique Ki of Go with 2^ vertices, add a clique K 2 with 2^ — 1 
vertices, each of weight Each vertex in Ki is adjacent to every vertex 
in K 2 that is, Ki IJ K 2 is a clique. Let Gi denote the resulting graph. The 
weight assignment to Ki IJ K 2 satisfies Kraft’s inequality. 

— For each sub-clique Ki of Gi, that has 2 t+i vertices such that 2 t of them 
have weight w and the rest have weight ^ , add a clique K 2 with 2 t ( 1 — 

-I- ^)) vertices, each of weight As before, Ki IJ K 2 forms a clique 
and the weight assignment satisfies Kraft’s inequality. Let G denote the 
resulting graph. 

Theorem 7. G is a chordal graph, and given the weight assignment as men- 
tioned above for w >8, there is no -\-l-prefix free code. 

Proof. The proof is omitted due to lack of space. 

Summary We have motivated the graph prefix free coding problem as a 
natural problem that arises from the issue of designing adversaries for sorting 
under partial information. The cases of the problem when the graphs are com- 
parability graphs or interval graphs are of specific interest. Positive results in 
the comparability graph case would yield adversaries for sorting under partial 
information and positive results in the interval graph case could be applied to 
the Dynamic Storage Allocation problem [6][p226, SR2]. 
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Abstract. One of the most important problem in computational biol- 
ogy is the tree editing problem which is to determine the edit distance 
between two rooted labeled trees. It has been shown to have significant 
applications in both RNA secondary structures and evolutionary trees. 
Another viewpoint of considering this problem is to find an edit mapping 
with the minimum cost. By restricting the type of mapping, Zhang [7,8] 
and Richter [5] independently introduced the constrained edit distance 
and the structure respecting distance, respectively. They are, in fact, 
the same concept. In this paper, we define a new measure of the edit 
distance between two rooted labeled trees, called less-constrained edit 
distance, by relaxing the restriction of constrained edit mapping. Then 
we study the algorithmic complexities of computing the less-constrained 
edit distance between two rooted labeled trees. For unordered labeled 
trees, we show that this problem is NP-complete and even has no abso- 
lute approximation algorithm unless P = NP, which also implies that it 
is impossible to have a PTAS for the problem. For ordered labeled trees, 
we give a polynomial-time algorithm to solve the problem. 



1 Introduction 

A labeled tree is a tree whose nodes are labeled from a finite alphabet E. A rooted 
tree is a tree in which one of the vertices is designated as root. An unordered 
labeled tree is just a rooted labeled tree. An ordered labeled tree is a rooted 
labeled tree in which the children of each node are ordered, i.e., if a node has k 
children, then we can specify them as the first child, the second child, • • •, and 
the fcth child. Unless otherwise stated, all trees we consider in this paper are 
rooted labeled trees. 

Let A be a symbol not in E and a, 6 G A U {A}. We use a — >■ 6 to denote an 
edit operation. Given a tree T, we consider three kinds of edit operations on T. 
(1) Change (i.e., a,b G E): means to change a node a to b. (2) Deletion (i.e., 
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6 = A): means to delete a node a such that the children of a become the children 
of the parent of a. (3) Insertion (i.e., a = A): means to insert a node b as a child 
of some one node, say c, such that b becomes the parent of some (consecutive, 
if T is ordered) children of c. 

Let 7 be a cost function which assigns each edit operation a — >■ 6 a non- 
negative real number 7(0 — >■ b) such that (1) 7(0 — >■ a) = 0, (2) 7(0 — >■ 6) = 
7(6 — >■ a), and (3) 7(0 — f c) < 7(0 b) + 7(6 — >■ c). The above function 7 
is called a metric cost function. Let S = (si, S2, • • • , Sk) be a sequence of edit 
operations. S is called an edit sequence of transforming tree A to tree B if there 
are trees Ti, T2, • • • , T^+i such that A = T\, B = T^+i and T^+i, where 1 < t < fc, 
is obtained by applying Si to Ti. The cost of S is defined to be 7(5') = 

For any two trees Ti and T2, the edit distance between them, denoted by 
De{Ti,T2), is defined to be the minimum cost of an edit sequence transforming 
Ti to T2. That is, De{Ti,T2) = min5{7(5)|5 is an edit sequence transforming Ti 
to T2}. The problem of computing De(Ti, T2) can be considered from a graphical 
viewpoint, called edit mapping. Let V{Ti) be the set of nodes of tree T^. An edit 
mapping from Ti to T2, denoted by Me(Ti,T2), is defined to be a subset of 
V{Ti) X V{T2) such that for any two (mi,ui) and (^2,^2) in Me{Ti,T2), the 
following conditions are satisfied: 

1 . Ui = U2 if and only if vi = V2 (one-to-one mapping). 

2. ui is an ancestor of U2 if and only if vi is an ancestor of V2 (i-C., the ancestor- 
descendant relationship is preserved). 

3. (Only for ordered labeled trees) ui is to the left of U2 if and only if vi is to 
the left of V2 (i.e., the sibling order is preserved). 

See Figure 1 for an example of an edit mapping between two ordered labeled 
trees Ti and T2. We simply use Mg instead of Me(Ti, T2) if there is no confusion. 
We call a pair (u, v) G Mg a mapped line and say that u and v are covered by 
{u,v) or Mg. In fact, the edit mapping shows a way to transform T\ to T2 as 
follows. Each mapped line (u, v) indicates that u should be changed to v if m yf u; 
otherwise, u remains unchanged. The uncovered nodes of T\ are to be deleted 
and the uncovered nodes of T2 are to be inserted. For example, the edit mapping 
of Figure 1 corresponds to an edit sequence (c — >■ A, / — >■ g, A — >■ c) to transform 
Ti to T2. 

Let U and V be the sets of the uncovered nodes in Ti and T2, respectively. 
Then the cost of Mg is defined to be j^Mg) = ^)+I]«gc/ T'('“ 

+ Y^vev 7(A — >■ v). In [10], Zhang and Shasha showed that the edit sequence 
with the minimum cost corresponds to an edit mapping with the minimum 
cost. In other words, the edit distance between Ti and T2 can be restated as 
Dg{Ti,T2) = min{7(Me)|Me is an edit mapping between Ti and T2}- 

Let lca{u, v) represent the least common ancestor of nodes u and u in a tree. 
Motivated by [6], Zhang [7,8] introduced the constrained edit mapping, denoted 
by Mg, which is an edit mapping satisfying an additional condition that for any 
triple (mi,Ui), (162,^2) and (^3,^3) in Mg, lca{ui,U2) is a proper ancestor of U3 
if and only if lca{v\,V2) is a proper ancestor of U3. In [5], Richter defined the 




340 Chin Lung Lu, Zheng- Yao Su, and Chuan Yi Tang 



structure respecting mapping, denoted by Mg, which is an edit mapping satis- 
fying an additional condition that for any triple (ui, vi), (m 2, ^2) and (^3,^3) 
in Mg such that none of u\ , U2 and U3 is an ancestor of one of the others, 
lca{ui,U2) = lca{u\,u^) if and only if lca{v\,V2) = lca{v\,v^). In fact, both 
the concepts of constrained edit mapping and structure respecting mapping are 
equivalent. Each one is a restricted variant of the edit mapping which prohibits 
the edit mappings as showing in Figures 3 and 4. The mapping of Figure 2 is a 
constrained edit mapping since T\ and T2 are isomorphic. The mapping of Fig- 
ure 3 is not a constrained edit mapping since lca{u\, U2) = U4 ^ lca{u\,uz) = Ms, 
but lca{v\,V2) = V4 = lca{v\,v^), or since lca{u\,U2) is not a proper ances- 
tor of M3, but lca{vi,V2) is a proper ancestor of M3. The mapping of Figure 4 
is not a constrained edit mapping since lca{u\,u^) = M5 = lca{u2,uz), but 
lca{v\,v^) = Ms lca{v2,vz) = V4, or since lca{u2,u^) is a proper ancestor of 
Ml, but lca{v2,v^) is not a proper ancestor of mi. The constrained edit distance 
(or structure respecting distance) between two trees Ti and T2 is defined to be 
Dc{Ti,T2) = min{7(Mc)|Mc is a constrained edit mapping between Ti and T2}. 

In this paper, we define a new variant of the edit mapping, called less- 
constrained edit mapping, by relaxing the restriction of constrained edit map- 
ping such that the edit mapping of Figure 3 is allowable. The less-constrained 
edit mapping Mi is defined to be an edit mapping which satisfies an addi- 
tional condition that for any triple (mi,Mi), (m 2,M2) and (M3, M3) in Mi such that 
none of Mi,M 2 and M3 is an ancestor of the others, lca{ui,U2) < lca{ui,u^) = 
lca{u2,u^) if and only if lca{v\,V2) < lca{v\,v^) = lca{v2,vz)- For example, 
the mapping of Figure 3 is a less-constrained edit mapping since lca{u\,U2) < 
lca{u\,u^) = lca{u2,u^) and lca{vi,V2) = lca{v\,vz) = lca{v2,v^). However, the 
mapping of Figure 4 is not a less-constrained edit mapping since lca{u\,U2) < 
lca{ui,u^) = lca{u2,u^), but lca{vi,V2) = lca{vi,v^) > lca{v2,v^). We define 
the less-constrained edit distance between two trees Ti and T2 as Di(Ti,T2) = 
min{7(Mi)|M; is a less-constrained edit mapping between Ti and T2}- 

The editing (respectively, constrained editing and less-constrained editing) 
problem is the problem of computing Dg(Ti,T2) (respectively, Dc(Ti,T2) and 
Di{Ti,T2)) between two trees Ti and T2. The editing problem is shown to 




Ms Ms 




Fig. 1. The dashed lines denote an edit Fig. 2. An edit mapping which is con- 
mapping. strained and less-constrained. 
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be NP-complete [11] and even be MAX SNP-hard [9] for two unordered la- 
beled trees, but is solvable in 0(|Ti| x IT 2 I x YmYi{depth{Ti),leaves{Ti)} x 
Tm\i{depth{T 2 ).,leaves{T 2 )'\) time for two ordered labeled trees [10]. The con- 
strained editing problem can be solved in 0(|Ti| x IT 2 I x {degree(Ti)+degree(T 2 )) 
X log(de(/ree(Ti) -|- degree{T 2 ))) time for unordered labeled trees [8] and in 
0(|ri| X IT 2 I) time for ordered labeled trees [7]. In this paper, we show that 
the less-constrained editing problem for unordered labeled trees is NP-complete 
and even has no absolute approximation algorithm, unless P = NP. The neg- 
ative result of absolute approximation also implies that it is very unlikely to 
have a PTAS for the problem. In fact, we can show this problem to be MAX 
SNP-hard. However, this result does not necessarily imply that it has no abso- 
lute approximation algorithm. For ordered labeled trees, we give an algorithm 
of 0(|ri| X IT 2 I X degree^{Ti) x degree^{T 2 ) x {degree{Ti) + degree{T 2 ))) time 
to solve it. 

2 NP-Completeness Result 

LCEP (Less-Constrained Editing Problem on Unordered Labeled Trees) 
Instance: Two unordered labeled tree T\ and T 2 and a positive integer K. 
Question: Is the less-constrained edit distance between Ti and T 2 < K1 
X3C (Exact Cover by 3-Sets Problem) 

Instance: A finite set X with |A| = 3n and a collection S of 3-element subsets 
of X with |5| = m. 

Question: Does S contain an exact cover for A, i.e., a subcollection S' C S 
such that every element of X occurs in exactly one member of 5? 

Note that X3C is a NP-complete problem [3]. Let X = {x\,X 2 , - ■ ■ ,xzn} 
and S = {S'!, S' 2 , • • • , S'm} be an instance of X3C. Without loss of generality, we 
assume that m > n. Then we transform X and S into an instance of LCEP, 
Ti and T 2 , as shown in Figure 5 and 6, respectively. For convenience, we let 
Si = C.2i ^*, 3 } for 1 < i < m and call a node with label a an a-node. 

The root of T\ is an i?-node which has m A-nodes as the children to corre- 
spond to Si,S 2 , - ‘ ‘ Sm of S. Each S'-node has three children which are labeled 
and respectively if it corresponds to Si. The root of T 2 is an i?-node 
and has m -I- 2n children in which 3n children are labeled xi,X 2 , ■ ■ ■ , x^n respec- 




Fig. 3. An edit mapping which is less- Fig. 4. An edit mapping which is nei- 
constrained, but not constrained. ther constrained nor less-constrained. 
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^1,1 il,2 il,3 ^2,1 h,2 h,3 ^m,l ^m,2 ^m,3 

Fig. 5. The unordered labeled tree Ti . 



R 




Fig. 6. The unordered labeled tree T 2 - 



tively and other m — n children are 5'-nodes. Each 5'-node has three j/-nodes as 
the children. Clearly, the construction of Ti and T 2 can be accomplished in poly- 
nomial time. For convenience, we assume that the cost of each edit operation is 
1, i.e., 7(u — >■ u) = 1 if u yf u. 

Lemma 1. Let M be any less-constrained edit mapping from T\ to T 2 . Suppose 
that there are k, 1 < k < IT2I, uncovered nodes in T2. Then '){M) > 3m-2n-\-k. 

Proof. Let U and V be the sets of the uncovered nodes in Ti and T2, respectively. 
Then \V\ = k. Since |Ti| = 4m -I- 1, IT2I = 4m — n-\- 1 and M is a one-to-one 
mapping, \M\ = |T2\F| = 4m — n — /c-l-l and \U\ = iTij — \M\ = n-\-k. For each 
uncovered node in U (V), it corresponds to a delete (insert) operation with cost 
1. Hence, U and V totally contribute a cost n-\-2k to 7(M), i.e., 7(M) > n-\-2k. 
If fc > 3m — 3n, then n 2k > 3m — 2n k and hence ”f{M) > 3m — 2n -\- k. 
Suppose that k < 3m — 3n. Then T 2 has at least (3m — 3n — k) j/-nodes covered 
by M. Since Ti has no y-node, each covered y-node corresponds to a change 
operation with cost 1. That is, the covered y-nodes in T2 contribute a cost at 
least 3m—3n—k to 7(M). Totally, U, V and the covered y-nodes in T2 contribute 
a cost at least 3m — 2n-\- k to y(M). Hence, 'y(M) > 3m -2n-\-k. □ 



Theorem 1. LCEP is an NP-complete problem. 

Proof. (Sketch) Given an instance X and S of X3C, we create an instance Ti 
and T 2 of LCEP as described in the beginning of this section. First, suppose 
that S has an exact cover S' . Then we find a less-constrained edit mapping M 
from Ti to T 2 as follows. (1) i?-node of Ti is mapped to i?-node of T 2 . (2) If Si 
is in S' , then we map the children of its corresponding S'-node in T\ to t^p-node, 
ti, 2-node and t^^s-node in T2. Note that the S'-node is not mapped. (3) If Si is not 
in S', then the subtree of Ti rooted at the corresponding S-node of Si is mapped 
to a subtree of T 2 rooted at an S-node in a way that root is mapped to root and 
leaf is mapped to leaf. Clearly, the above mapping M is a less-constrained edit 
mapping and Di{Ti,T 2 ) < 3m — 2n. 

Conversely, suppose that there is a less-constrained edit mapping M from 
Ti to T2 with cost < 3m — 2n. By Lemma 1, all nodes in T 2 are covered by M 
and hence T\ contain n uncovered nodes. Then the uncovered nodes of Ti and 
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the covered y-nodes of contribute a total cost 3m — 2n to 7 (M), which leads 
to 7 (M) = 3m — 2n. Hence, i?-node of T 2 is mapped to i?-node of Ti, S'-node 
of T 2 is mapped to S'-node of Ti, and ccj-node of T 2 is mapped to x^-node of 
T\, 1 < i < 3n. Let S' and S" denote any two S-nodes, which are mapped to 
each other, in Ti and T 2 , respectively. Then the children of S" (i.e., 3 y-nodes) 
are all mapped to the children of S' since the ancestor-descendant relationship 
is preserved by M. Hence, the uncovered nodes in Ti are n S-nodes and their 
children are mapped to xi-node, X 2 -node, • • •, X 3 „-node in T 2 , which implies that 
these n uncovered S-nodes of Ti correspond to an exact cover of 5. □ 

3 Negative Result for Absolute Approximation 

Recall that Zhang and Jiang [9] showed that the editing problem for two un- 
ordered labeled trees is MAX SNP-hard by a reduction from the maximum 
bounded covering by 3-sets, which is known to be MAX SNP-hard [4]. In fact, 
their proof can be applied to show that the less-constrained editing problem for 
two unordered labeled trees is MAX SNP-hard. Note that any MAX SNP-hard 
problem has no PTAS (Polynomial Time Approximation Scheme), unless P = 
NP [1], where a problem has a PTAS if for any fixed e > 0, the problem can 
be approximated within a factor of 1 -I- e in polynomial time. Hence, it is very 
unlikely for the less-constrained editing problem to have a PTAS. However, this 
negative result does not imply that there is no polynomial-time absolute ap- 
proximation algorithm for the problem. An approximation algorithm is absolute 
if there exists a constant c such that for every instance I of the problem, the 
absolute error \APP{I) — OPT{I)\ < c, where APP{I) and OPT{I) are the ap- 
proximate and optimal solutions of /, respectively [2] . In this section, we propose 
a technique of tree composition to show that the less-constrained editing problem 
for two unordered labeled trees has no polynomial-time absolute approximation 
algorithm, unless P = NP. For convenience, we assume that the cost of each edit 
operation is 1, i.e., ^{u v) = Hi v. For any two forests F\ and F 2 , we use 
Di{Fi,F 2 ) to denote the less-constrained edit distance between them, which is 
defined analogously to the case of two trees. 

Lemma 2. Let Fi and F 2 be any two forests of unordered labeled trees. Then 
we have ||F'i| — IF 2 II < A(Fi, F' 2 ) < |J^i| + |^ 2 |- 

Given any k unordered labeled trees Ti, T 2 , • • • , As, the composition of Ti,T 2 , 
• • • , Ac, denoted by A o T 2 ° ° Ac, is defined to be an unordered labeled tree 

which is obtained from A, T 2 , • • • , Ac by introducing a new root R such that the 
root of each Ti, 1 < i < k, is & child of R (see Figure 7). For convenience, we 
define A'' = A o A o • • • o A iff A = A = • • ■ = A- 

Let A and A be any two unordered labeled trees and k be any positive 
integer. Then we construct two new unordered labeled trees = {Tio F[)^ and 
G 2 = (A o iL)*, where iJ is a complete binary tree with {2k + 1)(|A| + |A|) 
y-nodes (see Figure 8 for an example of Gi). In each Gi, i = 1,2, the root has k 
children and the subtree rooted at each such child is Ti o H. We denote by Fi 
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the collection of k subtrees Tio H rooted at the children of the root of Qi. It is 
not hard to verify the following two lemmas. 

Lemma 3. A(f/i,f/ 2 ) < kDi{Ti,T 2 ). 



Lemma 4. Let M be an arbitrary less-constrained edit mapping with the min- 
imum cost from Gi to G2 ■ Then each subtree of T\ is mapped to exactly one 
subtree of by M. 

Lemma 5. > kDi{Ti,T 2 ). 

Proof. (Sketch) By Lemma 4, each subtree A = Ti o H of is mapped to 
exactly one subtree B = T2 o H oi T2 hy an optimal mapping M . Moreover, 
root{A) and root{B) are mapped each other. We use Ha and Hb to denote H 
of A and H of B respectively. Suppose that Ha is not covered by M. Then 
-y{M) > \Ha\ > kD[{Ti,T2), a contradiction. Case 1: Ha is mapped to exactly 
T2. By Lemma 2, y{M) > \Ha\ — IT2I > kDi(Ti,T2), a contradiction. Case 2: 
Ha is mapped to exactly Hb- The distance between A and B must be at least 
Di{Ti,T2). Case 3: Ha is mapped to both T2 and Hb- Then root{HA) and Ti 
are not covered by M. Hence, there are at least |Ti| -\- 1 uncovered nodes in A 
and at least \Hb\ + IT2I — \Ha \ {root{HA)}\ = IT2I -h 1 uncovered nodes in B, 
i.e., the distance between A and B is at least |Ti| -h |T2| -h 2 > D[{Ti,T2). □ 

By Lemmas 3 and 5, we have the following theorem. 

Theorem 2. Di{Qi,Q2) = kDi{Ti,T2) for any two unordered labeled trees T\ 
and T2- 



Theorem 3. If there is a polynomial-time absolute approximation algorithm 
of determining the less-constrained edit distance between two unordered labeled 
trees, then P = NP. 

Proof. Suppose that the problem of determining D;(Ti, T2) can be approximated 
by an absolute approximation algorithm A with absolute error c in polynomial 
time 0((|Ti|-|-|r2|)^) for some constant [ 3 . Then we let k = c-\-l and create Gi = 
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(Ti oH)^ and Q2 = (T2 as described previously with each size 0 (fc^(|Ti| + 

|T2|))- By applying A to and Q2, we can obtain an approximate solution 

APP(Qi,g2) of Di(Qi,Q2) in time + \T2\Y), which is polynomial in 

| 7 "i| + \T2\ since k and (D are fixed, such that \APP{Qi,Q2) — A(t/i,t/2)| < c. 
By Theorem 2 , we have |^PP(ai, 02 ) - kDi{Ti,T2)\ < c ^ _ 

Di{Ti,T2)\ < f. Let M denote the corresponding mapping of APP{ 0 i,Q 2 ). 
Then each subtree Ti o H of fFi is mapped to exactly one subtree T20 H of T2 
in M\ otherwise, APP{Qi,Q2) > |Tj o iL| > \H\ = ( 2 fc + l)(|Ti| + |T2|) (where 
f = 1 or i = 2 ), which leads to the absolute error \APP{Qi,Q2) — Di{Qi,Q2)\ > 
{k+ l)(|Ti| + |T2|) > c, a contradiction. We use M' C M to denote the mapping 
from Ti o H to T2 o H with the minimum cost among k such mappings and use 
Hi and H2 to denote H's oiTioH and T2 o H, respectively. Clearly, the cost of 
M' < Note that Hi must be covered by M' . Case 1 : Hi is mapped 

to exactly T2. Then APP{Qi,Q2) > |iLi| — II2I > 2 fc(|Ti| + IT2I), which leads to 
\APP{Qi,Q 2) - Di{Qi,Q2)\ > k{\Ti\ + |T2|) > c, a contradiction. Case 2 : Hi is 
mapped to exactly H2- In this case, we use the mapping of M' between Ti and T2 
as the approximate solution of finding the optimal mapping from Ti to T2, i.e., 
APP{Ti,T 2) < j{M') < Case 3 : Hi is mapped to both T2 and H2- 

Then M' does not cover Ti and hence j{M') > |Ti| + IT2I + |iL2| — |L^i| = 1^11 + 
|T2|- In this case, we use 0 as the approximate solution of finding the optimal 
mapping from Ti to T2, i.e., APP{Ti,T2) = \Ti\ + IT2I < j{M') < 

Hence, we find an approximate solution APP{Ti,T2) of Di{Ti,T2) according 
to APP{Qi,g2) such that APP{Ti,T2) < Then \APP{Ti,T2) - 

Di{Ti,T2)\ < \^^^^§^-Di{Ti,T2)\ < f = ^ < 1 . Since both APP{Ti,T2) 
and Di(Ti,T2) are integer-valued, APP(Ti,T2) = Di(Ti,T2), which means that 
the less-constrained editing problem is solvable by A in polynomial time and 
hence we have P = NP by Theorem 1 . □ 



4 A Polynomial-Time Algorithm 
for Ordered Labeled Trees 



We use D{Ti,T2) instead of Di{Ti,T2) if no confusion arises. Given an ordered 
labeled tree Ti, for convenience, we use the left-to-right postorder numbering 
to number the nodes of Ti from 1 to \Ti\. Following the notation used in [ 7 , 8 ], 
we use ti[k] to denote the /cth node of Ti, Ti[k] to denote the subtree rooted at 
ti[k] and Fi[k] to denote the ordered forest obtained by deleting ti[k] from Ti[k]. 
Note that the left-to-right order of the trees in Fi[k] is significant. For any two 
forests Fi[i] and ^2^], we use D{Fi[i], F2[j]) to denote the less-constrained edit 
distance between them. Let 0 denote the empty tree. Throughout the rest of 
this section, we let 1 < t < |Ti| and 1 < j < IT2I and let ti[fi], ti[t2], • ’ ’ h[*nj 
be the children of ti [z] and t2 [ji] , ^2 [^2] ,■ “h [juj ] be the children of t2 [j] ■ Due to 
the limitation of space, some proofs are omitted in this section. 
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Lemma 6. (1) D{0,0) = 0, (2) D{Fi\i],0) = ElLi D{Ti[ik],0), D{Ti[i],0) 
= i?(FiW,0) +7(ti[*] ^ A), (3) D{0,F2[j]) = ElLiD{0,T2[jk]), D{0,T2[j]) 
= D{0, F2 [j\) + 7(A -)> t2\j]). 

Lemma 7. Let M he a less- constrained edit mapping from Ti[i] to 72 [j]- (1) If 
both ti[i] and t 2 [j] are not covered by M, then M U {(ti[*],t 2 [j])} is a less- 
constrained edit mapping. (2) If both ti[i] and t 2 [j] are covered by M, then 
(ii[*],^2b1) G M. 



Lemmas. D{Ti[i],T 2 [j\) = min{D(0,T2[j])-hmini<fe<„^{D(Ti[i],r2bfe])- 
£)(0, r2bfe])},L’(T’i[*],0)+mini<fc<„,{iA(Ti[zfc],T2b])--D(T'i[ife],0)},7(ti[i] -)> 

t2[j\) + D{F,%F2\j])}- 

Proof. (Sketch) Let M be an less-constrained edit mapping with the minimum 
cost from Ti[i] to T 2 b], i-e., -D(Ti[i], T 2 b]) = l{M). By Lemma 7, we can com- 
pute 7 (M) by just considering the following three cases. Case 1: ti[i] is covered 
by M, but t2b] is not. Case 2: t2b1 is covered by M, but ti[i] is not. Case 3: 
t\[i] and t2b1 both covered by M. □ 

Suppose that M is any arbitrary less-constrained edit mapping from Fi[i\ 
to Jbb1- ^i[*] ^2b1> ii^ is mapped to more than one other tree 

by M, then we call M a 1-to-l less-constrained edit mapping; otherwise, we 
call M a 1-to-many less-constrained edit mapping. We use F 2 b]) and 

-D2(i^i[z], i^2b1) i-o denote the minimum costs of 1-to-l and 1-to-many less- 
constrained edit mappings from Fi[i] to ^2b1> respectively. Clearly, we have 

Z4(Fi[z],F2b1) = min{Z4i(Fi[t],F2b1),A(i^iW,i^2b])}- 

Lemma 9 . L>i (i^i [z] , F 2 b1 ) = min{ Hi ( [z] , T 2 b'l ] ■ • ■ ?2 b"i - 1 ] ) + -D ( 0 , 72 b'n,- ] ) , 
71i(7i[zi] • • • 7i[zrij-i], HzbD + 71(ri[z„J, 0 ), Hi(Ti[zi] • • • Ti [z„^_i], T 2 b’i] • • • 
T2bni-l])+i7(Ti[z„J,r2b„i])}. 

Lemma 10. Let M he a 1-to-many less-constrained edit mapping from Fi[i] to 
72 b1- 7/Ti[zfe] of Fi[i] (Tjibfc] of F 2 [j]) is mapped to more than one tree of F 2 [j] 
(Fi[z]), then ti[ik] (t 2 [jfc]) is not covered by M. 

Proof. (Sketch) Let ti[ai] and ti[a 2 ] be any two nodes in Ti[zfe] such that they 
are mapped to ^ 2 ^ 1 ] of T2bp] ^ 2 ^ 2 ] of Tj^bg] respectively. Suppose that 
ti[ik] is mapped to t 2 [b 3 \. Then, due to the ancestor-descendant relationship of 
M, t 2 ba] must be a common ancestor of t 2 [bi] and t 2 [^ 2 ], which contradicts to 
the fact that 72 [jp] and T 2 bq] are two disjoint trees. □ 



Lemma 11. Let M he a 1-to-many less-constrained edit mapping from 7\[z] to 
72 b1 ond let Ti[ik] of Fi[i] be mapped to 72bi] ’ ’ ’ 72br]> 1 < I < r < nj. (1) 
For any 1 < p < k, Ti[ip] can only be mapped to 72 [ji] ’ ’ ’ Tj^bz-i]- (2) For any 
k < q < Ui, Ti[iq] can only be mapped to T 2 br-i-i] ■ • • 72bni]- 
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Proof. (Sketch) (1) Let ti[a\] and ti[o2] of Tx[ik] be mapped to ^2(^1] of T^lji] 
and t2[b2] of T2[jr] respectively, and a\ < 02- Let ^1(03] be a node of Ti[ip] that 
is mapped to t2[bf\. Then 03 < oi < 02 and hence 63 < 5i < 62- If ^2(^3] is a 
node of 12 [ji], then Icd{t2[b'i],t2[bi\) < Icd{t2[bf\,t2[b2]) = Icd{t2[bi],t2[b2]) and 
?C(i(ti[a3],ti[ai]) = /c(i(ti[a3], ti[a2]) > lcd{ti[ai],ti[a2]), a contradiction □ 



Lemma 12. Let M he a 1 -to-many less-eonstrained edit mapping from Fi[i] to 
F2[j] and let T2[jk] of F2[j] be mapped to Ti[b] • • • 1 < ^ < r < nj. ( 1 ) 

For any 1 < p < k, T2[jp] can only he mapped to • • • Ti[ii_i]. (2) For any 

k < q < Uj , T2[jg] can only he mapped to Ti[v+i] • • • Tl[in^]■ 



Lemma 13. D 2 {Fi[i], F 2 [j]) = 



min 



/ 


^ii(Ti[zi]...Ti[*fe_i],r2[ji]---r2[jz_i]) ' 


' 


min < 


+D{Fi[ik\,T2[jl] ■ ■ ■ T2[jr\) + 7(^1 [*fc] ^ 






+D{Ti[ik+i\ ■ ■ ■ T2 [jV+i] • • • T2[jrij\) 






zi(Ti[*i]...Ti[zz_i],r2[ji]---r2[jfc_i]) 


/ 


min < 


+D{Ti[ii] ■ ■ ■Ti[ir],F2[jk]) + 7 (^ ^^[jk]) 






^ +D{Ti[ir+l] ■ ■ ■ Ti[z„J, T2bfe+l] • • • Tzb'nJ) ^ 


/ 



By above lemmas, we design the following algorithm to solve the problem in 
0(|Ti| X IT2I X degree^{Ti) x degree^{T2) x (degree{Ti) +degree{T2))) time. 



1: D{0, 0 ) — 0, Di ( 0 , 0 ) = 0, 1 ) 2 ( 0 , 0 ) = 0; /* Initialization */ 
for i = 1 to |Ti| do 

D(Fi[i], 0 ) = Efcii D{n[ik], 0); D{n[i], 0 ) = D(Fi[i], 0 ) + 7 (L[i] ^ A); 

for j = 1 to |T 2 | do 

D{0,F2[j]) = E^ii D(0,T2lik]y, D{0,T2[j]) = D{0,F2[j])+'y{\ ^ t2[j]); 
2: for i = 1 to |Ti| do /* The computation of Z)(Ti [i], T 2 [j]) */ 
for j = 1 to IT 2 I do 
for Pi = 1 to Hi do 
for p 2 = Pi downto 1 do 
for gi = 1 to Uj do 
for Q 2 = qi downto 1 do 

/* Let Fi = Ti[ip^] • • • Ti[ipJ and F 2 = T 2 [ig^] • • • T 2 [fqJ. */ 
Compute Di{Fi, F 2 ) by Lemma 9 and D 2 {Fi, F 2 ) by Lemma 13; 
D{Fi, F 2 ) — min{Di (Ti, F 2 ), D 2 {Fi, T 2 )}; 

end for 

Compute D(Ti[i],T 2 [ji]) by Lemma 8; 

end for 
end for 

3: Output D(ri[|ri|],r 2 [|T 2 |]). 
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A Highly Efficient Algorithm 
to Determine Bicritical Graphs* 
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Abstract. In this paper, we show a necessary and sufficient condition 
which characterizes all bicritical graphs. Using this necessary and suf- 
ficient condition we develop a highly efficient algorithm to determine 
whether a graph is bicritical, of which the time complexity is bounded 
by O (|U| |i?|), and this is the best result that has ever been known. 



1 Introduction and Terminology 

All graphs considered in this paper are finite, undirected, connected and simple. 
In general, we follow the terminology of [1]. A graph G is said to be bicritical if 
G — u — v has a perfect matching for each pair of vertices u and v in G such that 
u =/= V. Plummer[10] introduced the concept of n-extendable graphs. Let G be a 
graph on v vertices and n be an integer such that 0 < n < (u — 2) /2. Then G is 
n-extendable if every matching of size n in G is contained in a perfect matching 
of G. 

Bicritical graphs play a central role in the decomposition theory of graphs in 
terms of their maximum matchings. In [10], it is proved that every n-extendable 
graph is also (n — I)-extendable and also that any 2-extendable graph is either 
1-extendable bipartite or bicritical. Thus for non-bipartite graphs, if we let Gk 
denote the class of those which are fc-extendable and G{, those which are bi- 
critical, we have a nested sequence of graph classes: Gi D Gb D G2 D G3 — 
Moreover, it is known that each of the containments is proper. In [7], Lovsz and 
Plummer point out that the structure of bicritical graphs is far from completely 
understood. We show a necessary and sufficient condition to characterize all 
bicritical graphs in this paper. 

The concept of bicritical graphs is also generalized to n-critical graphs and 
there has been some research on this topic (see [2]-[6]). In [12], Yu gives a Tutte 
style necessary and sufficient condition for n-critical graphs. However, it does not 
help to design an efficient algorithm to determine the n-critical graphs. Using 
our necessary and sufficient condition, we can design a highly efficient algorithm 
to determine all bicritical graphs. 

* The work of this paper was supported by the National Science Fonndation of China. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 349-356, 2001. 

(c) Springer- Verlag Berlin Heidelberg 2001 
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2 A Necessary and Sufficient Condition 
for Bicritical Graphs 

In this section, we give a necessary and sufficient condition for bicritical graphs. 

Theorem 1. Let G he a graph with a perfect matching Mq, then the following 
propositions are equivalent: 

1. G is bicritical; 

2. For any perfect matching M and any two different vertices x and y in G, 
there is an M-alternating path between x and y, which starts and ends with 
edges in E (G) \ M 

3. For every pair of vertices x and y in G, there is an Mo-alternating path 
between x and y, which starts and ends with edges in E {G)\ Mq. 

Proof. We show that (1) (2) (3) (1). 

1. (1) (2): Suppose that G is bicritical, then for any perfect matching M in 

G and any two vertices u,v GV (G), G — {u, f } has a perfect matching. For 
any pair of vertices x,y &V (G), we discuss in two cases: 

Gase 1. xy ^ M. Suppose that xu, yv G M. G — {u, v} has a perfect match- 
ing. Let Ml = M — {xu,yv} be a matching in G — {M,f}, then both x 
and y are Mi-unsaturated. So, there must be an Mi-augmenting path be- 
tween X and y, otherwise, according to Berge’s theorem. Mi is the maximum 
matching in G — {u, f }, which contradicts the fact that G is bicritical. 

Gase 2. xy G M. For yw £ E (G) \ M, since G is 1-extendable, yw is con- 
tained in a perfect matching M' . Let Gi = G[MAM'], where MAM' de- 
notes the symmetric difference of M and M' . Then the component of Gi 
that contains xy is an even cycle with edges alternatively in M and M', and 
we have an M-augmenting path between x and y as desired, (see Figure 2). 

2. (2) ^ (3) follows trivially since (3) is a special case of (2). 

3. (3) (1) For any two vertices u,vinV (G), if uv G Mq, then G — {m, v} has 

a perfect matching Mq \ Suppose uv ^ Mg, and xu, yv G Mq. By (3), 

there is an Mg-alternating path P between vertices x and y, which starts 
and ends with edges in E (G) \ Mg, and P is an Mg \ {xu, yr^j-augmenting 
path in G — {u,v}. By exchanging edges in Mg and edges not in Mg along 
P, we have a perfect matching in G — {r(,f}.Thus, G is bicritical. 

We call such an M-alternating path P between two M-saturated vertices x and 
y, which starts and ends with edges in E (G) \ M a pseudo-M-alternating path. 




Fig. 1. G [MAM'\, M broken lines, M' solid lines. 
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3 Description of the Algorithm 

First, we can use an O {^/n |i?|) algorithm at hand to find a perfect matching Mq 
in a given graph G (see [8]), if such Mq cannot be found, then G is not bicritical, 
and we can make our conclusion at once. Otherwise, for any two vertices, say, 
X and y, we try to find an Mo-alternating path between them in G, which 
starts and ends with edges in E (G) \ Mo-This should be done for all pairs of 
vertices, and if there exists such a path between every pair of vertices in G, 
then according to Theorem 1, G is bicritical, otherwise, G is not bicritical. To 
determine bicritical graphs, we develop an algorithm, MAPS(a;, exist) in the 
main recognition program to find all the Mo-alternating paths which start and 
end with edges in E (G) \Mq in G between a given vertex, say x and every other 
vertex in G in O (|il|) time. 

The main recognition program to determine bicritical graphs 

1. exist:=true; // 0(1) time 

2. Construct an adjacency list for G; // O (|i?|) time 

3. Try to find a perfect matching Mq in G; j jO^yJn |i?|) time 

4. If such Mq does not exist then exit; (G is not bicritical) 

5. else for each of |y| — 1 vertices in G, say x do 
begin 

6. MAPS(x, exist); // pseudo- Mo-alternating path search procedure 

7. If exist=false then exit (G is not bicritical’) // 0(1) time 

end 

8. Exit (G is bicritical) // 0(1) time 

Our algorithm MAPS procedure is derived from the idea inspired by Edmond, 
there are some differences though, mainly in the handling of blossoms and in 
the way we extend an Mo-alternating tree. We use an adjacency list to store 
G. Suppose that all vertices have indices in sequence, and for all vertices, there 
exists a table in the storage, which is of the form below. 



Vertex (z) 


1 


2 


3 




n 


m 












f 












b 












p 













In this table, for any given vertex i, m (i) denotes the other vertex adjacent to 
i with an edge in Mq. f (z) represents the father of i, that is, in the construction 
of the alternating tree, the predecessor of i, and b (z) is the root of the blossom 
that contains z. Finally, if there is a pseudo-Mo-alternating path between x 
and z when constructing an Mo-alternating tree rooted with x, then p (i) = 1, 
otherwise p (i) = 0. Initially, let / (z) := 0; b (z) := z; p(i) = 0 for all z. 

Construct the adjacency list and the table for G, which needs O {\E\) time. 

An alternating tree with root x with respect to a matching M is a subgraph 
of G, Tree = (V,T), which is connected and without a cycle, such that: 
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1. X G V is the root of the tree, which is M-unsaturated; 

2. for each i G V, the unique path Lt (i) of the tree between x and i is an 
alternating path. 

The alternating tree of Figure 3 illustrates this definition. 




11 12 



Fig. 2. An alternating tree with root x = \. Heavy edges represent matching 
edges 

We shall say that the vertices of G which belong to V are labelled. The 
vertices of V (G) — V are called unlabelled. Let i G V he a labelled vertex. 

— If the path Lt (i) has an even number of edges, then we say that the vertex 
i is even. In particular, x, the root of the tree is even. 

— If the path Lt (?) has an odd number of edges, then the vertex i is said 
to be odd. In an alternating tree, the odd vertices must all be of degree 
2: this is precisely because there is always exactly one thin and one heavy 
edge incident to such a vertex. What’s more, it’s crucial to point out that if 
vertex i is labelled odd, then there must exist a pseudo- Mo-alternating path 
between x and i. 

— If one walks through an alternating tree from a vertex i to the root x, then 
the vertices which one meets are alternatively even and odd. 

Now, we are ready to introduce our pseudo- Mo-alternating path search procedure 
(MAPS) between x and every other vertex in G. We use breadth-first-search 
method (BFS) to handle every vertex in G. In each MAPS procedure, we need 
an even vertices queue Vq to store even vertices. Denote hyV^dV (G) the subset 
of vertices labelled even. The construction of the alternating tree will proceed 
step by step, starting from 



P=Po = W,T=0 



At each stage. Tree = {V, T) induces an alternating tree on G. The MAPS 
procedure is as follows: 

The pseudo- Mo-alternating path search procedure (MAPS) between x and 
every other vertex in G. 

1. Set the even vertices queue Vq empty; 

2. Store x in even vertices queue Vbjlabel x even and set p{x) := 1; 
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3. Remove the first vertex i from Vq; 

4. Use BFS to handle i, that is, for every edge e = {i,j) which is incident to i 
and j is not labelled odd, two cases are possible: 

5. Case 1: j is unlabelled; 

Label j odd; 

Set p(j) := 1; 

Get m (j) from the table; 

If m (j) is unlabelled then 
begin 

Add rn{j) to the even vertices queue Vq! 

Write / (j) := i, / (m (j)) := j to the table; 

Label m (j)even; 

end 

end of case 1; 

// T + {e} + {jm(j)} 

I ! V + {j} + {m{j)} 

I IVq ^ Uo + 

6. Case 2: j is also labelled even {j G Vq) 

7. While i' yf b{i') or j' yf b{j') do 

begin 

i' :=b{i'); 

/:=&(/); 

end 

If i' = j' then we do nothing else 

begin 

k:= 

8. While (neither i' nor j' has been marked) and {i' yf j') do 

begin 

Marked i' and j' 

:=M/ (*')); 

r :=&(/(/)); 

end 

If j' has been marked then r := j' 

else r = i' 
end 

Note 1. r, which has at least degree 3 (unless r = x), must be labelled even. 
The subpath Li and Lj of Lt (i) and Lt (j), on the one hand between r and 
i, and on the other hand between r and j, are even and jj, = Li + Lj + {e} 
is certainly of odd cardinality. The matching M/i = Mq fi /i is a maximum 
matching of /i and r, the only M /x-unsaturated vertex of the cycle, is linked 
with X by an even alternating path Lj^ (r) . Such a cycle /x is called a blossom 
of Mo, and the vertex r is called the root of the blossom. When a blossom 
has been found, each vertex in it must have a pseudo- Mo-alternating path 
that links it to the root x. Now we use a supplementary queue V\ to store 
the vertices of /x that are labelled odd and notice that we need not make 
any changes to those vertices that have already been included in some other 
blossoms. This can be done as follows. 
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9. While k r do 
begin 

If k is labelled odd then 

begin 

Put k into Vi, 

Re-label k even 

end 

else p (k) := 1; 
b (fc) := r 

Write b (k) to the table; 
k:=f{k) 

end 

10. While l^r do 

begin 

If I is labelled odd then 

begin 

Put I into Vi; 

Re-label I even 

end 

else p (1) := 1 
b{l) := r; 

Write b{l) to the table 

end 

11. Add Vi to the tail of Vb; 
end of case 2; 

12. If Vb is not empty then go to 4 

13. for every vertex i of G do 

if p{i) = 0 then exit(exist:=false); 

14. exit(exist:=true). 

4 The Correctness and the Time Complexity 
of the Program 

Lemma 1. If there exist pseudo-MQ-alternating paths between x and every other 
vertex y in G, then MAPS can find them. 

Proof. If y is labelled odd then the lemma trivially follows. If y is labelled even 
then only when it is contained in a blossom, there is a pseudo-Mg-alternating 
path between x and y. The last case is that the Mg-alternating tree rooted with 
X fails without reaching y. In this case, we prove that there is no pseudo-Mg- 
alternating path between x and y in G as follows: 

Let u, V be the vertices that are adjacent to x and y respectively, with 
matching edges. If there exists an Mg-alternating path between x and y in G — 
{u, u}, then it must be an Mg-augmenting path. When the alternating tree with 
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root X fails to reach y, the matching is maximum (with only x and y Mq- 
unsaturated) . Thus, according to Berge’s theorem, there is no Mo-augmenting 
path in G — {u, u}. So the lemma follows. 



Theorem 2. The program that determines whether any given simple graph G 
is bicritical is correct. 

Proof. For every pair of vertices x and y in G, the program is to check whether 
there exits a pseudo- Mo-alternating path between x and y, which starts and ends 
with edges in E (G) \ Mo- Thus, the theorem is a direct result from Theorem 1 
and Lemma 1 



Theorem 3. In the worst case, the time complexity of the program is bounded 

byO{\V\\E\). 

Proof. In the following proof, denote \V\ by N. First, let’s look at the main 
recognition program to determine bicritical graphs: obviously, program sections 
5-7 are a loop that checks each vertex in G, in the worst case, we must construct 
N — 1 alternating trees. The problem is therefore that of determining the number 
of elementary operations (additions, comparisons, and access to storeage) needed 
to construct an Mo-alternating tree, that is, the time complexity of the procedure 
MAPS. 

The structure of the graph G is supposed to be described by an adjacency 
list giving, for each vertex i, those vertices which are adjacent to i. Each edge 
e = (b j) appears thus twice in the list. 

In the procedure MAPS, program sections 1-3 need 0(1) time. Each edge 
is examined at most twice in our BFS in accordance to their appearance in the 
adjacency list. So the number of operations that work on the adjacency list is 
proportional to \E\. 

When a blossom p.i has been detected, the number of operations necessary 
to change the labels and the updates b (i) of the vertices of the blossom is pro- 
portional to \yLi\. Notice that the number of blossoms which might appear in the 
course of the construction of an alternating tree is at most [IV/2J (the integer 
part of iV/2). Since every vertex is handled at most once in a blossom, the total 
sum of |/ii|, i from 1 to v, is at most iV -|- A^/2 {N/2 is the maximal number 
of the roots of all blossoms). Thus, the number of all operations of this type is 
proportional to N . 

For those edges that won’t form blossoms when they are added to the al- 
ternating tree, the time complexity over all is 0(k), k is the number of such 
edges, and for those edges that will form blossoms when they are added to the 
alternating tree, the time complexity over all is O {N + N/2), thus in the pro- 
cedure MAPS from program sections 4-12, we need at most O {k + N + N/2) 
time, that is, O {\E\) time to construct an Mo-alternating tree. Section 13 needs 
O (N) time and section 14 needs O (1) time, so our MAPS needs O (|E |) time, 
and for all N alternating trees, we need O {N |E|).Thus the theorem follows. 
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5 Remarks 

If we use the algorithm derived from the definition of bicritical graphs, that is, 
choose every pair of vertices x and y oi G, delete x and y together with edges 
incident to them, and then check whether G—{u, u} has a perfect matching, then 
the time complexity of such an algorithm will come out as O which 

is obviously inferior to the algorithm we get above, with the space complexity 
almost the same. So, our algorithm has higher efficiency. 
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Abstract. We study the problem of producing hierarchical drawings of 
layered graphs when some pairs of edges are not allowed to cross. We 
show that deciding on the existence of a drawing satisfying at least k 
constraints from a given set of non-crossing constraints is NP-complete 
even if the graph is 2-layered and even when the permutation of the 
vertices on one side of the bipartition is fixed. We also propose simple 
constant-ratio approximation algorithms for the optimization version of 
the problem and we discuss how to extend the well-known hierarchical 
approach for creating layered drawings of directed graphs with the capa- 
bility of minimizing the number of edge crossings while maximizing the 
number of satisfied non-crossing constraints. 



1 Introduction 

The problem of embedding graphs in the plane when only some pairs of edges 
are allowed to cross, known in literature as realizability, has been formally in- 
troduced in [12]. It finds application, e.g., in VLSI layout, where the crossings 
between certain pairs of edges must be avoided due to the physical realization 
of connectors, and in constrained graph layout, where users can specify require- 
ments on the visualization that should be fulfilled by the drawing algorithm. In 
these settings, it is fundamental to decide if an embedding of a graph satisfying a 
given set of non-crossing constraints exists, and, if the answer is positive, to find 
it. Realizability has been proved to be NP-hard in [10], but surprisingly it is not 
known to belong to the class NP, turning out to be a very interesting problem 
from a theoretical perspective. Its theoretical relevance is mostly due to the con- 
nection with the classical themes of planarity and crossing numbers and with the 
recognition of string graphs, i.e., intersection graphs of curves in the plane. The 
relation between realizability and string graphs is addressed in [10,11]; here we 
limit to point out that no finite algorithm for the recognition of string graphs is 
known and that, given a graph G, it is possible to build an instance of realizabil- 
ity which can be satisfied if and only if G is a string graph [11]. It is also worth 
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observing that realizability is a generalization of the well-known planarity prob- 
lem, which requires to decide if a graph is planar and clearly coincides with the 
special case where no pair of edges is allowed to cross. We recall that planarity 
can be checked in linear time [9]. 

This paper is concerned with the realizability problem in the special case 
of hierarchical drawings [3], i.e., drawings where vertices are constrained to lie 
on a set of parallel lines and edges are represented as polygonal chains. The 
hierarchical drawing convention is widely used to represent procedure call de- 
pendencies, class hierarchies, is-a relationships, and several economic and social 
structures [3,5,14]. In the remainder of the paper we will refer to the vertex set 
on each line as layer or level, and sometimes to the drawing itself as layered 
drawing. Without loss of generality we also assume to deal with proper layered 
graphs, i.e., layered graphs whose edges span only consecutive levels: it is always 
possible to reduce to this case by properly adding dummy vertices to split edges 
with end-points in non-consecutive layers. Under this hypothesis each edge in a 
hierarchical drawing is simply represented by a straight-line. The main question 
that we address is the following: Given a proper layered graph G, a set of edge 
pairs C , and an integer k > 0, does a hierarchical drawing of G exist s.t. at least 
k edge pairs in C do not cross? We refer to this problem as hierarchical realiz- 
ability (HR) and to the elements of C as non-crossing constraints. We study the 
computational complexity of some variants of hierarchical realizability and we 
describe approximation strategies for the optimization versions of these prob- 
lems, that require to find a maximum realizable subset of C. We also show how 
algorithms for the variants that we consider can be used as subroutines to boost 
existing graph drawing algorithms to support non-crossing constraints. Our re- 
sults can be easily generalized to constraints concerning the relative positions 
of vertices within each layer, though for simplicity we will not consider them in 
this paper. 

In more detail, the presentation of our results is organized as follows. In Sec- 
tion 2 we prove that deciding on the realizability in case of hierarchical drawings 
is not easier than deciding on the realizability for generic drawings: we show that 
HR, which clearly belongs to NP, remains NP-hard even if G is 2-layered and 
A: = jCj. It is interesting to observe that the strictly related problem of deciding 
if a layered graph has a hierarchical drawing without edge crossings can be solved 
in linear time [8]. From an optimization point of view, it is trivial to approxi- 
mate a maximum cardinality subset of realizable constraints with expected ratio 
2, yet we can exhibit instances for which this bound is tight. In addition to the 
general case, in Section 3 we consider a one-sided realizability problem where 
G is 2-layered and the vertex ordering of a layer is fixed; we show this problem 
to be NP-hard for a generic k, approximable with constant ratio 2, and poly- 
nomial if k = jCj. Based on the one-sided formulation, in Section 4 we finally 
address the constrained edge crossing minimization problem in layered drawings, 
i.e., the problem of drawing hierarchically a graph with the minimum number 
of edge crossing while respecting a set of non-crossing constraints. We consider 
the well-known Sugiyama’s approach for producing layered drawings of directed 




Layered Drawings of Graphs with Crossing Constraints 359 



graphs [14] and we extend it in order to deal with non-crossing constraints. In 
particular, we provide a mechanism for identifying a subset C" C (7 of realizable 
constraints, and then we show how to minimize edge crossings while satisfying 
all the constraints in C . 

2 Hierarchical Realizability 

In this section we prove that deciding about the hierarchical realizability of 
a proper layered graph under a given set of non-crossing constraints is NP- 
complete. From an optimization point of view, we show that a maximum cardi- 
nality subset of realizable constraints can be easily approximated with expected 
ratio 2 and we exhibit instances for which this bound is tight. 

Lemma 1. HR € NP. 

Proof. Since edge crossings in layered drawings depend only on the permutation 
of the vertices within each layer, a non-deterministic algorithm has simply to 
guess an ordering of the vertices of each layer and to check in polynomial time 
the realization of at least k constraints. 

In the following we show the NP-hardness of HR on two-layered (i.e., bipar- 
tite) graphs. For brevity, we call bipartite straight-line drawing a hierarchical 
drawing with only two layers. Let B = {Vq,Vi^E) be a bipartite graph with 
n = riQ + ni vertices and m edges. Without loss of generality, we assume that for 
any (a,b) € E a € Vq and b € Vi. We also denote by ttq and tti two permutations 
of the vertices in Vq and Vi, respectively. Let C C be a set of pairs of edges. 
The hierarchical realizability problem on bipartite graphs consists of asking for 
a set C C C, \C'\ > k, and for a bipartite straight-line drawing of B such that 
for any (a, 6; c, d) £ C edges (a, b) and (c, d) do not cross in the drawing. More 
formally, the problem can be stated as follows: 

Bipartite realizability (in short, BR): Let B = {Vq,Vi,E) be a bipartite graph, 
let C be a set of non-crossing constraints, and let fc be a positive integer. Do two 
permutations tto and tti exist such that the bipartite straight-line drawing of B 
induced by tto and tti satisfies at least k constraints? 

In the special case where the permutation of the vertices in a layer (e.g., Vq) 
is fixed, we have the one-sided bipartite realizability problem (in short, OBR), 
whose complexity and approximability are discussed in Section 3. 

Before proving the NP-completeness of BR, we state it in a slightly different 
form which is more convenient for our purposes. Indeed, it is easy to see that 
asking for two permutations satisfying a set of non-crossing constraints C is 
equivalent to asking for two bijective functions ttq and tti such that: 

\/{a,x;b,y) £ C' (7To(a) - 7ro(6)) • (7 Ti(x) - 7Ti(j/)) > 0 (1) 

Based on this observation, we prove the NP-hardness of BR by means of a 
polynomial time reduction from a total ordering problem arising in the computa- 
tional biology field, called betweenness, which has been shown to be NP-complete 
in [13] and is defined as follows: 
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Betweenness: Given a finite set S of real variables and a set of triples T C S^, 
does an ordering of the variables exist such that for each (a, b,c) G T variable b 
is positioned between variables a and c in the ordering? 

We refer to the triples in T as ordering constraints. A solution for betweenness 
is a bijective function / : S -G {l..|S'|} such that: 

\f{a,b,c)GT (/(a)-/(6)).(/(6)-/(c))>0 (2) 

We remark that, in spite of the apparent similarity of Equation 1 and Equation 2, 
the former involves four distinct vertices and two different functions, while the 
latter only three variables and a single function. A one-to-one correspondence is 
therefore not straightforward. 

Lemma 2. Bipartite realizability is NP-hard. 

Proof. Moving from an instance of betweenness, we build an instance of BR 
as follows. The bipartite graph B = {Vq,Vi,E) is such that: (a) Vq = S; (b) 
Vi contains a distinct vertex x' for each item x G S, i.e., Vi = {x' : x G S}; 
(c) E = {{x,x') : X G S'} U {{a,b'),{b,c') : (a,b,c) G T}. The set of non- 
crossing constraints C is the union of two disjoint sets Ci and C 2 , where Ci = 
{{x,x';y,y') : x,y G S, x ^ y} and C 2 = {{a,b';b,c') : (a, 6, c) G Tj. Note 
\^i\ = l'5'|(|S| — 1) and IC 2 I = |T|. k = IGil -I- IG 2 I, i.e., the problem is to decide 
if all the non-crossing constraints can be realized. 

It is easy to verify that the reduction above requires 0(|T| -I- |Sp) time. To 
prove that k constraints in C can be realized iff the instance of betweenness 
admits a solution we exploit the one-to-one correspondence between ordering 
constraints in T and crossing constraints in C 2 and the fact that realizing all 
the constraints in C\ forces 7To(a) = 7Ti(a') for any pair of corresponding vertices 
a G Vo and a' G Vi. 

Let us first suppose that two permutations ttq and tti exist able to satisfy all 
the constraints in C. Then / = ttq represents a solution for betweenness. Indeed, 
let (a, b,c,) G T he an ordering constraint in the instance of betweenness. Let us 
consider the constraints (6, 5';c, c') G Ci and {a,b';b,c') G C 2 : these constraints 
necessarily exist by construction. Since ttq and tti satisfy all the non-crossing 
constraints, by Equation 1 we know that: (7To(a) — 7 To(6)) • (7Ti(6') — 7Ti(c')) > 0 
and (no(b) — 7To(c)) • (7Ti(&') — 7Ti(c')) > 0 By multiplying the left members of 
these inequalities, we obtain (7To(a) — 7 To(6)) • (7To(b) — 7To(c)) > 0, proving that 
the ordering constraint (a, b,c) G T is satisfied by / = tto. 

Conversely, let us assume that the instance of betweenness admits a solution 
/ and let us define tto = / and Va' G Vi Tri(a') = f(a). Note that Va G S' 
7To(a) = 7Ti(a'). Any constraint {a,a';b,b') G Gi is trivially satisfied by ttq and 
7Ti, since 7To(a) — 7To(b) = 7Ti(o') — 7Ti( 6') and therefore their product is > 0. Let 
(a, b'; b, c') be any constraint in C 2 . As above, (7 To( 6)— 7ro(c))-(7ri(6')— 7 Ti(c')) > 0. 
Moreover, since ttq = / satisfies constraint (a, b, c) G T, by Equation 2 (7To(a) — 
7To(6))-(7ro(5)— 7To(c)) > 0. By multiplying the left members of the two inequalities 
we finally get (7To(a) — 7 To(6)) • (7Ti(&') — 7ri(c')) > 0, i.e., constraint (a, b'; b, c') is 
satisfied by ttq and tti, as well. 
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The NP-completeness of HR immediately follows from Lemma 1 and Lemma 2: 
Theorem 1. Hierarchical realizability is NP-complete. 

We now consider the problem of approximating a maximum set of realizable 
constraints. If we choose, for each layer of G, a random ordering of the vertices 
in that layer, we approximate a maximum set of realizable constraints (chosen 
from C) with expected ratio 2. 

If c = \C\ and Ci = (a,x;b,y), 1 < t < c, is a non-crossing constraint 
involving vertices from any two consecutive layers Lj and Lj^i, four orderings 
among a,b € Lj and x,y € Lj+i are possible, yet only two of them realize c^. 
Hence, the probability Pi of having constraint Cj satisfied is equal to Let Xi be 
a random variable equal to 1 if constraint is realized, 0 otherwise. The quantity 
represents the number of realized constraints and the expected value 
for it is then equal to ^i] = Yh=i E[Xi] = = f . 

We now show that the ratio above is tight, i.e., no algorithm can do better 
than trying to satisfy half of the constraints in C, unless being able to recognize 
non-realizable instances of HR. The following lemma is useful at this aim: 



Lemma 3. The number of non-crossing pairs of non-incident edges in any bi- 



partite straight-line drawing of is equal to ^ 



n(n— 1) 



Proof. Any bipartite straight-line drawing of has the same number x of 

non-crossing pairs of non-incident edges. Then: 

^ = 5 E”=1 (E.<. Yh<, 1 + Yh>, i) = (^) ' 



Let G = Kn^n and let C contain a constraint for any pair of non-incident 
edges. It is easy to see that c = \G\ = — — and therefore, due to Lemma 3, 

the number of realized constraints is always equal to | independently of the 
permutation of the two layers. 



3 One-Sided Bipartite Realizability 

In this section we focus on the one-sided bipartite realizability problem, that 
turns out to be very useful in the design of algorithms for solving HR, as we 
will show in Section 4. Unfortunately, we can prove that it is NP-complete, yet 
easier than BR, being polynomially solvable if we are interested in the existence 
of a solution satisfying all the constraints in G (i.e., for k = IC'D. 

NP-completeness. For simplicity we prove the NP-hardness of one-sided bi- 
partite realizability in two steps, making use of a simple generalization of this 
problem which takes into account also crossing constraints in addition to the 
non-crossing ones. We refer to the generalized problem as one-sided bipartite 
realizability with mixed constraints (in short, MBR). MBR is stated exactly as 
OBR, except for taking into account a set of crossing constraints / C with 
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the following meaning: for any (o, b; c, d) G I, edges (a, 6) and (c, d) should cross 
in the bipartite drawing of graph B. 

With arguments similar to those used in the proof of Lemma 1, it is easy 
to see that both OBR and MBR belong to NP. We now prove the NP-hardness 
of MBR by means of a polynomial-time reduction from the acyclic subgraph 
problem. Then we further reduce MBR to our original realizability problem 
OBR. We recall that the acyclic subgraph problem is stated as follows: 

Acyclic subgraph (in short, AS): given a directed graph G = (V, A) and a positive 
integer h, does a set A' C A exist such that |A'| > h and the graph G' = (P, A') 
is acyclic? 

This problem has been proved to be NP-complete in [6] . 

Lemma 4. One-sided bipartite realizability with mixed constraints is NP-hard. 

Proof. Given an instance of AS, we build an instance of MBR as follows. The 
bipartite graph B = (Vo,Vi,E) is such that: (a) Vi = V] (b) Vq contains a 
distinct vertex x' for each vertex x G V, i.e., Vq = {x' s.t. x GV}; (c) V(a;, y) G A 
both (x', y) and {y' , x) belong to E. Permutation tto is any ordering of the vertices 
in Vo- For each arc (x, y) G Awe set up a constraint (x', y; y' , x): if 7 To(x') < 7To(y') 
we add (x',y;y',x) to I, otherwise we add (x',y;y',x) to C. k = h. Figure 1 
provides an example of this reduction. A directed graph G is given in Figure 1(a) 
and the bipartite graph B obtained from it is shown in Figure 1(b), together 
with the whole set of constraints obtained choosing ttq = {d', a',c', b'). 





O) 



7to=<d',a’,c',b’> 
C|=(a’,b;b’,a) e I 
C2=(a’,c;c’,a) e I 
C3=(b’,d;d’,b) e C 
C4=(c’,d;d’,c) e C 
C5=(d’,a;a’,d) e I 




Fig. 1. From acyclic subgraph to one-sided realizability with mixed constraints. 



It is worth observing that there is a one-to-one correspondence between con- 
straints and arcs of the digraph and that the constraint (x', y, y' , x) associated to 
arc (x, y) belongs to C if and only if 7 To(x') > 7To(y'). To prove the correspondence 
between solutions of AS and solutions of MBR we now exploit a basic idea: any 
permutation of the vertices of a digraph partitions its arcs into left-to-right and 
right-to-left arcs. We recall that a topological sort of an acyclic digraph yields 
no right-to-left arc. Based on this idea. Figure 2 points out the meaning of both 
crossing and non-crossing constraints: if arc {x' ,y') is left-to-right according to 
7To, then we force x and y to have the same ordering as x' and y' using a crossing 
constraint (see Figure 2(a)); if arc {x',y') is right-to-left according to ttq, then 
we force x and y to have the opposite ordering of x' and y' using a non-crossing 
constraint (see Figure 2(b)). 
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Fig. 2. Crossing and non-crossing constraints: (a) {x',y;y',x) G /; (b) 
{x',y;y',x) £ C. In both cases arc {x,y) is forced to be a left-to-right arc. 



Roughly speaking, our constraints force the presence of as many left-to-right 
arcs as possible. Proving that any permutation tti realizes k constraints from 
/ U (7 iff it yields an acyclic subgraph of G containing h = k arcs is now easy. 
Let c = {x',y,y',x) be a constraint from / U C. If c G / (i.e., 7 To(x') < Tfoiu')), 
then c is realized by tti if and only if 7ri(a;) < TTi{y). Analogously, if c G C 
(i.e., TTo(x') > 7To(t/')), then c is realized by tti if and only if tti(x) < 7Ti(y). In 
both cases the satisfaction of constraint c implies that arc (x, y) is a left-to-right 
arc according to tti, and vice-versa. Therefore, any permutation tti satisfies a 
number of constraints equal to the number of left-to-right arcs that it induces. 

In the example in Figure 1, choosing e.g. tti = (a,6, c, d) partitions the arcs 
of G as in Figure 1(c) and yields the bipartite straight-line drawing of B shown 
in Figure 1(d). In this drawing only constraint (d',a;a\d) G / is not satisfied; 
note this constraint corresponds to the unique dotted arc (d, a) G A \ A'. 

Theorem 2. One-sided bipartite realizability is NP-hard. 

Proof. Let us consider an instance of MBR specified by a bipartite graph B = 
(Vo,Vi,E), a set of crossing and non-crossing constraints I U G, a permutation 
ttq, and an integer k (w.l.o.g. in the following we assume that 7To(a) < 7To(b) for 
any constraint {a,x]b,y) G dU G). We build a corresponding instance of OBR 
specified by a bipartite graph B' = {Vq, V(, E'), a set of non-crossing constraints 
C , a permutation ttq, and an integer k' as follows. 

For each crossing constraint (a, x; b,y) G / we create a vertex named ab, an 
edge (ab,y), and a non-crossing constraint {ab,y, a,x). We denote the sets of all 
the new vertices, the new edges, and the new constraints with Vq, E, and G, 
respectively. We then set Vf = VqUVq, V( = Vi, E' = E U E, C = C U C, and 
k' = k. At last, we choose any permutation ttq such that all vertices in Vq are 
to the left of vertices in Vq (their relative positions are not important for our 
purposes), while the vertices in Vq retain the same ordering as in ttq. 

In the following we show that any permutation tti satisfies the same number 
of constraints in the instance of MBR and in the corresponding instance of OBR, 
proving a one-to-one correspondence between the solutions of the two problems. 
It is obvious that any permutation tti satisfies the same constraints from G in 
both instances. Moreover, there is a one-to-one correspondence between con- 
straints in I and constraints in G: let (a,x;b,y) G / and let (ab,y;a,x) G G 
be the corresponding non-crossing constraint. Recall that by construction we 
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have 7Tg(a6) < 7TQ(a) < 7Tq( 5). Therefore, the crossing constraint {a,x]b,y) is 
realized by tti if and only if the non-crossing constraint {ab,y;a,x) is, proving 
that each satisfied constraint from I corresponds to a satisfied constraint from 
C and vice-versa. 

Approximability and polynomiality. In order to prove that OBR is polyno- 
mial if A: = \C\ and that a maximum set of realizable constraints can be approxi- 
mated within a constant ratio we use an approximation preserving reduction [1] 
to the maximum acyclic subgraph problem. This problem is the optimization 
version of AS: given a digraph H = (N,A,w) with positive arc weights w, it 
requires to find a maximum weight set of arcs A' C A such that the subgraph 
H' = {N, A') is acyclic. 

Let B = {Vo,V\,E) and tto specify an instance of one-sided bipartite realiz- 
ability. W.l.o.g. we assume that for each {a,x]b,y) G C it holds 7To(a) < 7 To(6); 
otherwise we change C by removing (a, x] b, y) and adding the analogous con- 
straint (6, y; a,x). In the reduction we build a weighted constraint digraph H = 
(N,A,w) as follows: N = Vi; A = {{x,y) : 3 a,b G Vq s.t. (a,x;b,y) G C}; 
V(x, y) G A, w{x, y) is the number of constraints of C containing x and y in the 
second and fourth positions, respectively. In other words, the weight of arc (x, y) 
is the number of constraints that require tti{x) < TTi{y) in order to be realized. 

An almost trivial observation is that H is acyclic iff all the constraints in C 
are realizable. This means that OBR can be solved in linear time if fc = \C\, 
thanks to the fact that both the previous reduction and checking the acyclicity 
of the constraint digraph can be accomplished in linear time. 

The following property also holds: each A' C A such that the subgraph 
H' = {N, A') is acyclic corresponds to a realizable set of constraints C' C C 
with \C'\ > w{A'). Indeed, if tti is a topological sort of H' , then tti satisfies at 
least all the constraints corresponding by construction to the arcs in A', whose 
quantity is w(A') = y)evi' could happen that tti satisfies more 
constraints if A' is not maximal). Finally, it is easy to see that the weight of 
a maximum acyclic subgraph of the constraint digraph equals the maximum 
number of realizable constraints, i.e., w{A*) = IC*!. 

Let r > 1 be any approximation ratio for the maximum acyclic subgraph 
problem. From the previous considerations we have: 

\C'\>w{A')>l-w(A*)=l-\C*\ 

proving that C' is an r-approximate solution for one-sided bipartite realizability, 
as well. Hence, a maximum set of realizable constraints can be approximated with 
ratio 2 [7]. 

4 Constrained Crossing Minimization 

A widely used approach for creating layered drawings of directed graphs is pre- 
sented in [14] and is known in literature as hierarchical approach. In order to 
produce readable drawings, different aesthetic criteria are taken into account by 
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this method; among them the number of edge crossings deserves special atten- 
tion. In this section we show how to extend the hierarchical approach in order 
to support non-crossing constraints. Our extension hinges upon the approxima- 
tion algorithm for one-sided bipartite realizability presented in Section 3. We 
first briefly recall how Sugiyama’s algorithm works. Given a directed graph G, 
it performs four main steps. (1) Cycle removal: the direction of some arcs is 
temporarily reversed to make G acyclic. (2) Layer assignment: each vertex is 
assigned with a level so that all the arcs “flow” in the same direction. Dummy 
vertices are added for arcs spanning more than two levels. (3) Crossing reduc- 
tion: vertices in each level are ordered so as to minimize the total number of edge 
crossings. (4) Coordinates assignment: vertical coordinates are proportional to 
layers, horizontal coordinates depend on the permutation of the vertices within 
each layer. At this point the direction of the reversed arcs is also restored. 

In order to extend the hierarchical approach to support non-crossing con- 
straints we need to focus on steps 2 and 3, because neither the removal of cycles 
nor the assignment of coordinates are affected by this kind of constraints. In the 
following we denote with G the directed graph to be drawn and with G the set 
of non-crossing constraints to be realized in the drawing. 



Extending step 2. We recall that in this step the directed graph G is trans- 
formed into a proper layered graph. The set of constraints G must be therefore 
changed accordingly, due to the fact that dummy vertices could be added to 
split an edge involved in some constraint. We let the algorithm assign layers 
to vertices as usual, where l{v) denotes the level of a vertex v after the layer 
assignment. Then we change G as follows. 

Let (a, x) and (&, y) be any two arcs of G such that constraint (a, x; b, y) G G; 

w. l.o.g. we can assume l{a) < l{x) and l{b) < l{y). If l{x) = l{a) I and l{y) = 

Z(6)-|-l we do not touch G. Otherwise, at least a dummy vertex has been added to 
split (a,x) or (b,y) or both and constraint (a,x; b,y) must be suitably replaced. 
Let {uk, Uk+i, • • ■ , Uk+s) be the path that replaced arc (a, x), with Uk = a, Uk+s = 
X, k = l{a), and k s = l{x). Analogously, let {vh,Vh+i, ■■■,Vh+t) be the path 
that replaced arc (6, y), with Vh = b, Vh+t = y, h = l{b), and h-\-t = l{y). Let also 
[r, r-\-q\ = [fc, /c-|-s]n[/i, h-\-t]. If [r, r-\-q] yf 0 we replace constraint (a, x; b, y) with 
the constraints (u^, u^+i; u^+i) . . . {ur+q-i,Ur+q;Vr+q-i,Vr+q). Otherwise we 

remove it, since it is going to be satisfied by any drawing obtained from layer 
assignment 1. Figure 3 shows an example of the two cases. It is easy to get 
convinced that an original constraint will be realized in the final drawing if and 
only if all the new constraints are satisfied. 

Thanks to the preprocessing of set G described above, we can now assume 
to deal only with constraints involving edges that join vertices in consecutive 
layers. More formally, we assume that G has k layers, named LQ...Lk, and 
that each {a,x]b,y) G C is such that 3i G [0..k — 1] such that a, & G Li and 

x, y € Li+i. Based on this assumption, we can partition G into k—1 sets, named 
Cop . . . Cfc_ip, where Cjp+i contains the constraints related to edges with end- 
points in Li and Li+i. 
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(a) (b) 



Fig. 3. Constraints replacement during the layer assignment step: dummy ver- 
tices are squared, (a) {a,x; b,y) is replaced by constraints between edges in the 
grey band; (b) the grey band is empty: (a, x; b, y) is removed from C. 



Extending step 3. Before attempting at minimizing edge crossings, we in- 
troduce a constraint realization step, aimed at finding a large set of realizable 
constraints. Then we devise a crossing minimization strategy able to take into 
account and satisfy the set of realizable constraints previously identified. 

We recall that the most used strategy for crossing minimization in hierar- 
chical drawings is based on a layer-by-layer sweep heuristic [3], that requires 
to repeatedly solve a crossing minimization problem on a bipartite graph with 
one side fixed. We therefore adapt the layer-by-layer sweep method to support 
non-crossing constraints, by repeatedly solving a one-sided bipartite realizability 
problem as follows. 

A vertex ordering ttq for Lq is first randomly chosen. Then, for i = 0 to 
k — 1, the bipartite subgraph Bi of G induced by the vertices in Li U Ti+i and 
the set of constraints are considered. An approximated solution to OBR 

on this instance is found, keeping layer Li fixed. This leads to identify a set 
C'i j_|_i C Ci^i+i of realizable constraints. We have then to find a vertex ordering 
for Li+i which minimizes edge crossings while realizing the constraints of C' i_^_i- 
Not all crossing minimization algorithms can be easily adapted to accomplish 
this task (think, e.g., of the well-known median/barycenter algorithms [4,14]); 
here we suggest a modification of the penalty minimization technique discussed 
in [2]. 

With this method, a penalty digraph P (with vertex set Li+i) is built from Bi 
and TTi, a feedback arc set F is found on P, and a topological sort of the vertices 
in the subgraph of P obtained by deleting arcs in F is returned as ordering for 
Li^i. We recall that a feedback arc set of a directed graph is a subset of arcs 
whose removal makes the digraph acyclic; the feedback arc set problem requires 
to find a minimum weight feedback arc set. 

Crossing constraints can be incorporated in the penalty based approach by 
suitably assigning weights to some arcs of P and by applying the previous al- 
gorithm as is. Namely, if (a,x;b,y) is a constraint in C'a_^i with a,b € Li and 
x,y € Tj+i, we add arc (x,y) to P if it does not already exist, and we set 
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w{x,y) = oo. Since is realizable, the subgraph of P induced by the infi- 

nite weight arcs is acyclic. Any approximation algorithm for the feedback arc set 
problem applied on P will therefore choose none of these arcs, thus minimizing 
crossings while satisfying all the constraints in 

As far as the algorithm above is concerned, it may happen that a pair of edges 
decomposed by means of dummy vertices may cross several times. This can be 
avoided by assigning oo weight to suitably chosen arcs in the constraint digraph. 
We also remark that our algorithm can be extended to deal with constraints 
concerning the relative positions of vertices within each layer. Due to the lack 
of space, we defer the details to the full paper. 
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Abstract. Hierarchical decompositions are a useful tool for drawing 
large graphs. Such decompositions can be represented by means of a data 
structure called hierarchy tree. In this paper we introduce the notion of 
V-validity of hierarchy trees with respect to a given property "P: this 
notion reflects the similarity between the topological structure of the 
original graph and of any high-level representation of it obtained from 
the hierarchy tree. We study the V-validity when the clustered graph is 
a tree and property V is the acyclicity, presenting a structure theorem 
for the V-validity of hierarchy trees under these hypotheses. 



1 Introduction and Preliminaries 

Many graph drawing algorithms have been designed during the last years [2] and 
most of them move from the following assumption: in order to find a pleasant 
layout of a graph, we first of all should be able to recognize its graph-theoretical 
properties, e.g., acyclicity, planarity, bipartiteness, and so on. Actually, exploit- 
ing the relevant features of a graph helps produce better visualizations for it. It 
is also well accepted that only few graph drawing algorithms scale up well and 
are able to visualize the more and more large graphs that arise in practical appli- 
cations. Clustering techniques are a useful tool to overcome this drawback and 
make it possible dealing with graphs not fitting in the screen [3,4, 5, 6]. Recur- 
sively clustering the vertices of a graph leads to a hierarchical decomposition of 
the graph itself: each cluster in the decomposition is considered as a single node 
which is visualized instead of a set of vertices of the original graph, consider- 
ably reducing the dimension of the drawing. Thanks to the parent relationships 
between clusters in the hierarchical decomposition, the viewer can move from a 
high-level representation of the graph to another one by detailing or shrinking 
clusters. The considerations above immediately lead to the following problem: 
Given a hierarchical decomposition of a graph G and a graph-theoretical property 
V satisfied by G, does any high-level representation of G obtained from the hier- 
archical decomposition satisfy property V ? Though this question naturally arises 
when using hierarchical decompositions for visualizing large graphs, as far as we 
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know it has not been previously studied in literature. Some related work is due 
to Feng et al, who focus on clustered planar graphs and investigate the concept 
of compound planarity [5]. 

In this paper we introduce the general notion of V-validity of hierarchical de- 
compositions of graphs with respect to a given property V . This notion reflects 
the similarity between the topological structure of the original graph and of any 
high-level representation of it and allows us to compare different hierarchical 
decompositions associated to the same graph. We then focus on the case where 
the clustered graph is a tree and property V is the acyclicity and we present a 
structural characterization of the V-validity under these hypotheses, i.e., condi- 
tions on the structure of the clusters necessary and sufficient to guarantee the 
V-validity of the decomposition. 

We start with preliminary definitions and notation. A hierarchical decompo- 
sition of a graph can be represented by means of a data structure called hierarchy 
tree (or sometimes cluster or inclusion tree) [1,5]. A hierarchy tree HT = (N,A) 
associated to a graph G = (V, E) is a rooted tree whose leaves are the vertices of 
G. Nodes of a hierarchy tree are called clusters: a cluster c represents a set Vc of 
vertices of G, namely, the vertices that are the leaves of the subtree rooted at c. 
We say that such vertices are covered by c and we refer to their number as cardi- 
nality of c. For brevity, we write u ^ do indicate that a vertex u G V is covered 
by a cluster c G N. We call singleton cluster a cluster with cardinality equal to 
1. W.l.o.g. we assume that the vertices covered by a cluster c are a proper subset 
of the vertices covered by the parent of c in HT, i.e., in the hierarchy tree there 
is no node with a unique child. Under this hypothesis, the number of nodes of 
a hierarchy associated to a n- vertex graph is at most 2n — 1. This implies that 
adding a hierarchy tree on the top of a graph requires only space linear in the 
number of vertices of the graph itself. 

It is possible to visit a hierarchy tree in a top-down fashion by performing 
expand and contract operations on the clusters visualized at any instant of time: 
these operations define a navigation of HT. The previous ideas can be formalized 
by defining the concepts of covering and of view [1,6]: 

Definition 1. A covering G of a graph G = {V, E) on a hierarchy tree HT = 
{N, A) is a subset of the nodes of HT such that each vertex of G is covered by 
exactly a node in G . 

A view of graph G on HT is a graph W = (G, L) such that C is a covering of 
G on HT and L = {(c, c') | c, c' G G, c yf c', 3(w,u) G E : u <c and v -< c'}. 

We refer to the edges of W as links. It is worth observing that any view is a 
simple graph: if there exist two edges {u, v) G E and {u' , v') G E which lead to a 
connection between two clusters c and d , only a single link (c, c') is considered in 
L. We also call HA and Wi = G the views generated by the root of the hierarchy 
tree and by all its leaves, respectively: in other words, Wr and Wi are the least 
and the most detailed representations of the graph, respectively. 
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2 P- Validity of Views and Hierarchy Trees 

A navigation of a hierarchy tree should help the viewer to focus on particular 
interesting regions of the graph, according to his/her necessities; hence, if the 
clusters are generated not taking into account the topological structure of the 
graph, no benefit may derive for the viewer from the clustering structure. In- 
formally speaking, only vertices in the same “locality” of the graph should be 
grouped together to form a cluster. 

A motivating example of the concept of V-validity is illustrated in Figure 1, 
where two different hierarchical decompositions associated to a chain with 8 
vertices are considered (Figure 1(a) and Figure 1(d)). The hierarchy trees HTi 
(Figure 1(b)) and HT 2 (Figure 1(e)) corresponding to these decompositions are 
both complete binary trees of height 3 and differ only in the permutation of 
their leaves. Figure 1(c) and Figure 1(f) report three views related to coverings 
{E, F}, {E, C,D}, and {A,B,C,D} in the two decompositions, respectively. All the 
views built from HTi maintain the structural property of the original graph to 
be a chain, while the views from HT 2 loose this property introducing cycles, up 
to become even a clique. 
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Fig. 1. Hierarchical decompositions, hierarchy trees, and views of a 8-vertex 
chain 



In order to characterize the “semantic” differences of hierarchy trees with 
“syntactically” similar or even identical structure it is then natural to introduce 
a notion of validity of a view and of a hierarchy tree with respect to a certain 
property V: from now on we refer to this concept as V-validity. 

Definition 2. Let HT be a hierarchy tree associated to a graph G and let V he 
a property satisfied by G. A view W of G obtained from HT is V-valid iff W 
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satisfies property V . HT is V -valid iff all the views of G obtained from it are 
V-valid. 

In view of the fact that Definition 2 is parametric in the property V to be 
considered, different notions of validity may be thought for different classes of 
graphs. For example, we may require a view of a bipartite graph to be bipartite or 
a view of a planar graph to be planar. But even more sophisticated definitions 
for V can be considered: as an example, we refer to the c-planarity property 
introduced by Feng et al. [5]. The interest in characterizing P-valid hierarchy 
trees naturally follows from the considerations in Section 1 . In this paper we focus 
on the case where the clustered graph is a tree and property V is the acyclicity: 
for brevity, we will speak of valid views/hierarchy trees under these hypotheses. 
In Section 3 we prove necessary and sufficient conditions for a hierarchy tree to 
be valid. 

3 A Structural Characterization of Valid Hierarchy Trees 



Before presenting the structure theorem for the validity of a hierarchy tree, we 
give preliminary definitions and lemmas useful for proving it. First of all we study 
the connectivity of views obtained from hierarchy trees of connected graphs. 

Lemma 1. Each view obtained from a hierarchy tree associated to a connected 
graph is connected. 

In the following we denote with S'(c) the subgraph of T = (V, E) induced by 
the vertices covered by a node c of HT. For each u,v GT, let pathriu, v) be the 
path joining u and u in T and let distriu, v) be the length of this path. 

Definition 3. Let c be a node of a hierarchy tree HT associated to a free tree 
T. Let u and v be two vertices ofT covered by c. u, v are a broken pair of cluster 
c iff they are neither coincident nor connected in S{c). A broken pair u,v is 
a minimum-distance broken pair of c iff w yi. c, \/w G pathxiu^v) such that 
w u,v. 

Lemma 2. Let W be a view on a hierarchy tree HT associated to a free tree T. 
If W is not acyclic, then in each cycle C there is at least a cluster containing a 
broken pair. 

Proof. Let C = (ci,...,Ch) be a cycle in W. Each cluster Ci G C is endpoint 
of two links (cj_i,Ci) and (cj,Cj+i), respectively (to simplify the notation we 
assume that ft- + 1 = 1). Let us call ft) and (r*, ft+i) two tree edges which 
derive links (ci_i,Cj) and (ci,Ci+i), respectively. For each cluster Cj we therefore 
identify two vertices covered by it, named f and (see Figure 2). If no broken 
pair exists in any cluster of C, for each i G [l,ft] vertices f and rt are either 
coincident or connected in S{ci). This implies the existence of a cycle in tree T, 
that is a contradiction. 
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Fig. 2. Vertices, edges, clusters, and links involved in cycle C in the proof of 
Lemma 2 



Theorem 1. Let T = (V,E) be a free tree and let HT = (N,A) be a hierarchy 
tree associated to T. HT is valid iff for each minimum- distance broken pair u,v 
of HT distriuju) = 2. 

Proof. We first prove the necessary condition, showing that the existence in HT 
of a minimum-distance broken pair u,v such that distT{u,v) > 2 implies that 
HT is not valid. Let c be a node of HT such that u, v are a minimum-distance 
broken pair in c. Let us consider the view W = {C, L) where covering C consists 
only of singleton clusters, except for cluster c: C = {c}U{{a:}, x GV and x 7 ^ c}. 
Let z and w be the vertices in pathxiu^v) such that (u,z) G E and (w,v) G E. 
Since u,v is a minimum-distance broken pair of c, z 7 ^ c, w 7 ^ c, and any 
other vertex in pathT{u,v) is not covered by c. Moreover, since distriuju) > 2, 
distriz, w) > 0, i.e., z and w are distinct vertices. Hence (c, {z}, . . . , {w}, c) is a 
cycle in W, proving that HT is not valid. 

We now focus on the sufficient condition, proving that a contradiction can 
be derived if we suppose that HT is not valid while satisfying the property on 
the minimum-distance broken pairs formulated in the statement of the theorem. 
Namely, we assume that in HT there exists a non-valid view W = {C,L), i.e., a 
view W that is not a tree. Since W must be connected due to Lemma 1, it must 
contain a simple cycle C. We prove that this assumption leads to a contradiction. 

The general idea of the proof is to convert W into another view W', still 
existing on the hierarchy tree, such that: (a) the number of singletons of W is 
strictly greater than the number of singletons of W ; (b) the cycle C of W is also 
changed into a simple cycle C in W' . The sequence of manipulations that we 
perform has finite length, as the number of singletons is clearly upper bounded 
by n = \V\. We can therefore prove that at some step during this process we find 
a contradiction due to Lemma 2, since we obtain a cycle with no broken pairs. 

Let C = (ci, . . . , Ch) be a simple cycle of length hinW and let Ci, k, and 
be defined as in the proof of Lemma 2 for 1 < i < h (see also Figure 2). Due 
to Lemma 2 a broken pair must exist in C. Let U and r* be the vertices in such 
a broken pair of C. We remark that h and are not necessarily a minimum- 
distance broken pair. 

Let us consider the path from h to in T, which is unique and not completely 
contained in S{ci). On this path we can univocally identify a set of k vertices 
Uj, for 1 < j < fc, such that Uj -< Ci but its successor on pathrih^ri) is not 
covered by Cj. Analogously, we can univocally identify a set of k vertices Vj, for 
1 < J < fc) such that Vj -< Ci but its predecessor on pathT{k,ri) is not covered 
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Fig. 3. (a) Broken pair li,ri in cycle C; (b) cycle C in the proof of Theorem 1 



by Cj. Observe that it could be Uj = Vj for some j, but vertices with different 
indexes always belong to different connected components of S'(cj). 

It is clear that Vi = It and Uk = Ti. Besides, it is easy to see that the pairs 
Uj,Vj+i, for 1 < j < k — 1, are minimum distance broken pairs of cluster Ci 
and, by hypothesis, we know that distT{uj,Vj+i) = 2. Let us call Zj the unique 
vertex of T in the path between Uj and vj+i and let c' be the cluster of W that 
covers Zj. The configuration is illustrated in Figure 3(a). 

We now change view W into W by expanding cluster Cj at the singleton 
level, i.e., by substituting with the set of singleton clusters corresponding to 
the vertices in S{ci). In the following we prove that we are able to exhibit in W' 
a simple cycle C . Let us first consider the cycle C shown in Figure 3(b). C clearly 
exists in kF', being obtained from C by unrolling pathT{h,ri) (compare vertices 
and clusters involved in Figure 3(a) and in Figure 3(b), respectively). However, 
C is not necessarily simple. W.l.o.g. we can assume that c' ^ c(, Vj, s G [1, fc — 1], 
j yf s. We can always reduce to this situation as follows: while C contains a pair 
j, s such that l<j<s<A:— 1 and c' = c^, substitute the path (c' , . . . , c(.) 
with the path (c'). After this operation all the c' are distinct. If Vj G [l,k— I] 
c'j ^ C, C' = C is a, simple cycle. Otherwise 3j G [l,k — 1] such that c' G C and 
two cases may happen: 

— c'l = Ci-i'. change C by replacing the non-simple path (ci_i,pi, c^,P 2 ) with 
the simple path (ci_i,p 2 )- If fc = 1 or Vj G [2, fc — I] c' ^ C, then the 
modified C is a simple cycle and we can choose C = C. Otherwise, let s 
be the smallest value in [2, k — 1] such that c(, G C, i.e., c(, = Ct for some 
t G [l,/i], t i. Moving clockwise on C, we find the simple cycle C = 
{ci-i,p 2 , c' 2 , ■ ■ ■ ,Ps,c'g = Ct ^ Ci_i), where ~ indicates a subpath of p. 

— c( Ci-i: let s be the smallest value in [1, k— 1] such that c(, G C, i.e., = c* 

for some t G [1, h], t ^ i. If s = 1 then C = (ci_i,pi, c'l = Ct ^ Ci_i) is a sim- 
ple cycle in IF'. Note that the length of the path (c* ~ Ci_i) is > 1, because 
Ct = c[ y^ Ci_i. If s>l then let C' = (ci_i,pi, c'i,p 2 , 4, . . . ,Ps,c^ = Ct ~ Ci_i). 
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In this case it could be Ct = Ci-i; however, the fact that the path from Ci_i 
to c'g has length > 3 guarantees C' to be a cycle of W . By construction we 
also know that C is simple. 

In any case we are therefore able to find a view W' containing more singletons 
than W and to exhibit a simple cycle C in W' . Iterating this reasoning, we obtain 
either a cycle with no broken pairs, which is absurd due to Lemma 2, or a cyclic 
view containing only singleton clusters, i.e., Wi. This is also a contradiction 
because Wi is acyclic being equal to T. 

Theorem 1 implies that the validity of a hierarchy tree can be checked in poly- 
nomial time. We remark that this does not immediately follow from Definition 2, 
since the number of views in a hierarchy tree may be exponential. 
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Abstract. A three- valued function / defined on the vertex set of a graph 
G = (V,E), / : U — >■ { — 1,0, 1} is a minus dominating function if the sum 
of its function values over any closed neighborhood is at least one. That 
is, for every v G V, /(A[u]) > 1, where N[v] consists of v and all vertices 
adjacent to v. The weight of a minus function is f{V) = X/ugy 
The minus domination number of a graph G, denoted by 7 “(G), equals 
the minimum weight of a minus dominating function of G. In this paper, 
sharp lower bounds on minus domination of a bipartite graph are given. 
Thus, we prove a conjecture proposed by J. Dunbar etc. (Discrete Math. 
199(1999) 35-47), and we give a lower bound on 'yks(G) of a graph G. 



1 Introduction 

For a graph G = (U, E) with vertex set V and edge set E, the open neighborhood 
of u G U is N{v) = {u G V\uv G E} and the closed neighborhood of v is 
A[w] = {ri} U N{v). For a set S of vertices, we define the open neighborhood 
N{S) = Uvi=sN{v), and the closed neighborhood N[S] = N{S)US. A dominating 
set S for a graph G = (V, E) is a subset of the vertex set V such that every vertex 
u G U is either in S or adjacent to a vertex in S. The domination number of G, 
7 (G), equals the minimum cardinality of a dominating set. 

For a real function / defined on vertices of a graph G and S Q V, write 
f{^) = SijgS /('^) ^ minus dominating function of G is 

defined in [3] as a function / : V -G {—1,0,1} such that f[v] > 1 for each 
V G V. A signed dominating function of G is defined in [5] as / : U — >■ (—1, 1} 
satisfying f[v] > 1 for all v G V . A minus(signed) dominating function / is 
minimal if every minus(signed) dominating function g satisfying g(y) < f(y) 
for every v G U, is equal to /. It is easy to see that a minus dominating 
function is minimal if and only if for every vertex v G V with f(y) > 0, 
there exists a vertex u G A[u] with f[u] = 1 and a signed function is min- 
imal if and only if every vertex v of weight 1, there exists some u G TV)?;] 
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such that f[u] = 1 or 2. The minus domination number for a graph G is 
7 “(G) = min{/(t7)| / is a minimal minus dominating function}. Likewise the 
signed domination number for a graph G is 7 s (G) = min{/(y)| / is a minimal 
signed dominating function} 

A majority dominating function of G is define in [2] as / : K — >■ {—1, 1} such 
that f[v] > 1 for at least half the vertices of G, and the minimum weight of such 
a function is the majority domination number. 

For a positive integer k, a k-subdominating function(kSF) of G is a function 
/ : y — >■ {—1, 1} such that f[v] = /(“) — ^ least k vertices of G. 

The aggregate ag(f) of such a function is defined by ag{f) = /(^) 

k-subdomination number yks{G) by jks = iTiin{ag{f) : / is a fcSF of G}. In 
the special cases k = \V\ and k = \ , yks is respectively the signed domination 
number 7 s (G) and the majority domination number ymaj(G). 

Since the problems of determining the signed domination number and minus 
domination number are NP-complete, many works on bounds for y~{G) and 
7 s (G) were studied in [4, 7, 8, 9, 10, 11, 12]. In [3], the following conjecture was 
given. 

Conjecture 1 [3]. If G is a bipartite graph of order n, then 7 “(G) > 4(\/n + 1 — 
1) — n. 

2 Lower Bounds on Minus Domination 
of a Bipartite Graph 

Theorem 1. If G = (A, Y) is a bipartite graph, then 7 “(G) > 4(-\/n + 1 — 1) — n 

Proof. Let / be a minus dominating function of G satisfying f{V) = 7 “(G) 
and 



M = {v € V\ f{v) = —1} 

P={vGV\ f{v) = l} 

Q = {v€V\ f(v)=0} 

Mx = Mnx, My = MnY, Px = PnX, Py = PnY, Qx = QnX, Qy = 
Qr\Y, ma: = \Mx\, my = |My|, Px = \Px\, Py = \Py\, Qx = \Qx\, Qy = \Qy\- 
Since f[v] > 1 for every v G V, we have \N{v) fl Px\ > 2 for every v € My- 
So 

6(Px,My) > 2my. (1) 

For every v G Px, |-^(^^) HMy] < \N{v) flPyj. Then 

e{Px,MY)= \N{v)nMY\ 
vePx 

< ^ |iV(u)nPy| 
vePx 

< PxPy 



( 2 ) 
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By (1) and (2) we have 



2my < PxPy 



Similarly 

2mx < PxPy, 

then 

rrix + my <PxPy (3) 

Since 



n = Qx + Qy + rrix + my + px + Py, 



and 

‘^y/PxPy <Px+ Py, 

we have 



‘^^/PxPy + mx + my + qx + Qy < n. (4) 

Using (3) and (4) we have 

2^Jmx + my + mx + my + Qx + Qy < n, (5) 

and 

2^\/mx + my + mx + my < n. (6) 

Using (5) we obtain 

7-(G) = /(U(G)) 

= n-{qx + Qy) - 2{mx + my) 

> mx + my + 2^Jmx + my — 2{mx + my) 

= 2^mx + my - {mx + my). (7) 

For notation convenience, we define the following 

a = y^mx + my, 
h(y) = y^ + 2y (y>l), 

g(y) = 2y-y^ (y>l). 

Since ^ = 2y + 2 > 2, ^ = 2 — 2y < 0, so h(y) is a monotonous increasing 
function and g(y) is a monotonous decreasing function. By (6) we have h(a) = 
+ 2a < n. And when y = —1 -\- + n, 

h{y) = ( — 1 + Vl + n)'^ + 2( — 1 + Vl + n) 

= 1 — 2\/l + n + 1 + n — 2 + 2-\/l + n 



= n. 
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So a < — 1 + Vl + n. 

By (7) we obtain 

7"(G) > g{a) 

> g{-\ + vTTn) 

= 2( — 1 + Vl + n) — ( — 1 + Vl + 

= 2(— 1 + -\/l + n) — (1 — 2-\/l + n + 1 + n) 
= 4(\/n + 1 — 1) — n. 



We now show that this bound is best possible by the following graphs G 
construct by J. Dunbar ect.[3]. Let s > 4 be an even integer, and let H be 
isomorphic to | disjoint copies of 7 ^ 2 , 5 • Let Hi and H 2 be two disjoint copies 
of H. Further, let Xi and Yi be the sets of vertices of Hi of degree 2 and s, 
respectively, for i = 1,2. Now let G be the graph obtained from Hi U H 2 by 
joining every vertex of Yi to every vertex of I 2 • Then G is a bipartite graph of 
order n = s{s + 2) with partite sets Xi U Y 2 and X 2 LIY 1 . Let / be the function 
on G defined as follows: let f{v) = — 1 if u G Xi U X 2 , and let f{v) = 1 if 
u G Yi U l 2 - Then it is easy to verify that / is a minus dominating function on 
G with 7 - (G) = /(Y(G)) = 2s - = 4(\47TT - 1) - n. □ 

Theorem 2. If G = {X, Y) is a bipartite graph of order n, then 



7 (G) > \n 




e 

1 + 7nax{Sx^ ^y) 



)i 



where Sx = rnin{d{v)\v G X}, Sy = min{d{v)\v G Y}, and the bound is sharp. 

Proof. Let /, Mx,My,Px,Py, Qx,QY,mx, my,qx, Qy,Px and Py be defined as 
in the proof of Theorem 1 . For any a; G Y, let G denotes the number of vertices 
of weight 0 in N(x). Then we have 



|fV(a;) nM| < 



d(x)—tx 
2 > 
d(x) — l — tx 
2 

d{x)-tx _ 
2 



if X € P 
, if x€Q, 
I if X € M 



So 



^ d{y)= Y. |A^(a;)nMy|+ Y \N{x)nMY\+ Y \N{x)nMY\ 
y^My xGPx x^Qx x^Mx 



< 



E 

xePx 



d{x) - tx 
2 



+ E 

xeQx 



d{x) - 1-tx 
2 



+ E( 

x^Mx 



d{x) - tx 
2 



1 ) 



= E( 

xex 



d{x) 



1 

^Qx - mx 



(8) 



Obviously 



mySy < E 

V^My 



(9) 
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xex ^ ^ yeQv 

Combining (8), (9) and (10) we obtain 

{qy + 2my)SY + {qx + 2mx) < e. 

Similarly we have 

{qx + 2mx)Sx + {qy + 2my) < e. 

If qx + 2rrix < qy + 2my, by (11) and (12) we have 

qx + 2ma; < - ■ — - r , 

1 + max{dx,OY) 
e 



qy + 2my < 



So 



7 (G) = n - {qx + 2nix + qy + 2my) 

>\n-{^ + , J 1 

l + max{Sx,SY) 

> r«-(7 + i " , j i- 

^6 l + max{Sx,SY) 

If Qy + 2my < qx + 2mx, by (11) and (12) we have 

qy 2niy - I Yc c \ 

1 + max{Sx,SY) 

qx + 2mx < 

Ox 



So 



(10) 



( 11 ) 



(12) 



7 (G) = n - {qx + 2mx + qy + 2my) 

Ox l + max{dx,dY) 

0 1 + max{0x, oy) 

In fact, this bound is sharp, it is easy to check that 7 “(it'i^fe) = 1 = [n — | + 

t 1 

l-\-max{5x ,^y) ' 



3 A Lower Bound on fc-Subdomination Number 
of a Graph 

The concept of fc-subdomination was introduced by Cockayne et al.[2]. In [2], 
Cockayne et al. established a sharp lower bound on jks for trees. Moreover, 
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they also gave a sharp lower bound on 7 ^^ for trees if A: < ^ and proposed a 
conjecture. 

Theorem 3 [ 2 ]. For any n- vertex tree T and integer k G {1,2, .. . ,n}, < 

2{k + 1 ) — n. 

Conjecture 2 [ 2 ]. For any n- vertex tree and any k with < k < n, ^ks < 
2k — n. 

In [13], the conjecture was proved and a upper bound for a connected graph was 
given. 

Theorem 4 [13]. For any connected graph of order n and any k with < k < 
n, then 

In this section, we give a lower bound for a graph G. 

Theorem 5. For any graph G of order n and size e, 

2e + (n — k)(A + 2) 

JTl ■ 

Proof. Let / be a /c-subdominating function on G with f{V) = 7^5 (G). Let P 
and M be the sets of vertices in G that are assigned the values 1 and —1, respec- 
tively. Then jPj -|- \M\ = n and jksiG) = jPj — \M\ = n — 2\M\. Furthermore, 
we let 

Pi = {v € P\ f[v] > 1} 

P2 = P- Pi 

Ml = {v G M\ f[v] > 1} 
ik /2 = ^ — ^1 

Clearly, jPij -|- jMij > k. Since each vertex v of P\ is adjacent to at most ^d{v) 
vertices of M, each vertex v of Mi is adjacent to at most — 1 vertices of M. 
We have 

6\M\ < E = E \MDN{v)\ 

i>eM v^v 

<1:^+ E <<(-) 

wePi «gMi DGP2UM2 

v£V VGP2UM2 

<e-\M\ + ^ ^ {d{v) + 2) 

•UGP2UM2 

<e-lMl + (lP2| + |M2l)^. 
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As I-P2I + \M^ 2 \ < n — fc, it follows that 

2e+(n-fc)(Z\ + 2) 
' 2(5+1) 



lks = n- 2\M\ 

2e + (n — k)(A + 2) 

> n ; . 

5+1 

This completes the proof of Theorem 5. □ 

For the graphs in which each vertex has odd degree, the lower bound on 7^^ 
in Theorem 5 can be improved slightly. 

Theorem 6. For every graph G in which each vertex vertex has odd degree, 

2e + (n — k)(A + 2) — k 

iTl ■ 



lks>n- 



Proof. Let /, P, M, Pi, P 2 , Mi and M 2 be defined as in the proof of Theorem 5. 

Since every vertex of G has odd degree, it is easy to see that each vertex v of 
Pi is adjacent to at most vertices of M, each vertex v of Mi is adjacent 

to at most — 1 vertices of M . Hence we have 



5|M| < ^ d{v) = |MnfV(r;)| 



vgm vev 






VGP2UM2 



= lJ2d(v)-lm\ + \Mi\)-\Mi\ + l 



v£P2UM2 



<«-hirii + |M.i)-iMi+ y: 



< e - )(l^’Il + |Mil) - |M| + m + 

Since |Pi| + |Mi| > fc, IP2I + IM2I < n — fc, we have 

(A+l)|M|<e-t + +±l^^ 



Hence 



\M\ < 



2e + {A + 2){n — k) — k 
2(5+1) ■ 



jks{G) = n - 2\M\ 

2e + (Z\ + 2)(n — k) — k 

>n — ; 

5+1 



Thus 
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This completes the proof of Theorem 6. □ 

By Theorem 5 and Theorem 6, we easily obtain the following lower bounds 
on for r-regular graphs. 

Theorem 7. Let G be a r-regular graph of order n, then 

/ ~ 'n, for r even 

~ \ - n, for r odd 

In the special cases where k = \V\ and k = Theorem 7 deduces to the 

following results. 

Corollary 7 [9]. For every r-regular graph G of order n, 



ls{G) > 



forr odd 
for r even 



and the bounds are sharp. 

Corollary 8 [9]. For every r-regular(r > 2) graph G of order n, 



^raaj (G) ^ 



and the bounds are sharp. 
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Abstract. It is well known that in a graph, k(G) < A(G) < S{G), where 
k(G), A(G) and (5(G) denote the vertex connectivity, edge connectivity and 
the minimum degree of G, respectively. We show that in chordal graphs, 
if A(G) 7 ^ 5(G), then A(G) > 2k{G) — 1. In contrast, in a general graph 
A(G) can be equal to k(G). 



1 Introduction 

1.1 Chordal Graphs: Some Definitions and Notation 

Let C be a cycle in a graph G. A chord of C is an edge of G joining two vertices 
of G which are not consecutive. A graph G is called a chordal (or triangulated) 
graph iff every cycle in G of length 4 or more has a chord. Chordal graphs arise 
in many applications (see [3,4,5]). Chordal graphs constitute one of the most 
important subclasses of perfect graphs [3]. 

Let G = {V,E) he a, connected undirected graph. Throughout this paper we 
will use V for the set of vertices of G and E for the set of edges. \V\ will be 
denoted by n. N (v) will denote the set of neighbours of v, that is N (v) = {u G 
V : {u, v) G E}. For A C V, we use N{A) to denote the set ~ The 

subgraph of G induced by the nodes in A will be denoted by G{A). 

A bijection f : V — >■ {1, 2, • • • , n} is called an ordering of the vertices of G. 
Then f(v) is referred to as the number associated with the vertex v, or simply 
the number of v with respect to the ordering /. Given an ordering / of a graph 
G, we define the following terms. 

Definition 1. Let A G V . The highest(A) is defined to be the vertex with the 
highest number in A. 



Definition 2. A path P = {w\,W 2 , • • • , Wk) in G is ealled an increasing path, iff 
f{wi) < f{w 2 ) < ■ ■ ■ < f{wk)- It is called a decreasing path iff f{wi) > f{w 2 ) > 

■ ■ ■ > f{wk)- A single node can be considered as either increasing or decreasing. 
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Definition 3. A vertex u G N{v) is called a higher neighbour of v iff f{u) > 
f{v). The set of higher neighbours of v will be denoted by Nh{v), ie 

Nh{v) = {u€ N{v) : f{u) > f{v)} 

Definition 4. An ordering f ofG is called a perfect elimination ordering (PEO) 
iff for each v gV, G({f } IJ is a complete subgraph (clique) of G. Then 

f{v) will be called the PEO number of v. 

Note that every graph may not have a PEO. The following theorem is well 
known (see [3]). 

Theorem 1. An undirected graph G is chordal if and only if there exists a PEO 
forG. 

Note that there can be more than one PEO for a given chordal graph. We 
just use PEO(v) to denote the PEO number of v, since our arguments are valid 
with respect to any PEO. Whenever we talk of a chordal graph, we will assume 
a PEO for it and the words highest, increasing path etc will be used with the 
respect to this PEO. 

1.2 Edge Connectivity and Vertex Connectivity 

The vertex connectivity k{G) of an undirected graph is defined to be the min- 
imum number of vertices whose removal results in a disconnected graph or a 
trivial graph (ie, single node). A subset S' C V is called a separator if G{V — S) 
has at least two components. Then, k{G) is the size of the minimum sized sepa- 
rator. Note that a complete graph has no separator and its (vertex) connectivity 
is by definition n — 1. 

The following result called Monger’s theorem is well known among graph 
theorists. An elegant proof due to Dirac is given in [2]. 

Theorem 2. Menger’s theorem : The minimum number of vertices separat- 
ing two nonadjacent nodes s and t is equal to the maximum number of vertex 
disjoint s — t paths. 

Thus, there are at least k{G) vertex disjoint paths between every pair of 
nonadjacent nodes of G. 

The edge connectivity A(G) of a graph G is defined to be the minimum 
number of edges whose removal results in a disconnected graph or a trivial graph. 
A minimum cut is a cut with its size = A(G). Note that the minimum degree 
S{G) of the graph is an upper bound for mincut size (ie edge connectivity). 

1.3 Our Result 

It is well known that the following inequality (due to Whitney [6] ) holds in a 
general undirected graph. 



k(G) < A(G) < (5(G) 
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where S{G) is the minimum degree of the graph. For a discussion of this inequality 
and for related work see the chapter on connectivity in [2] . In [1] , Chartrand and 
Harary show that for all integers a, &, c, such that 0 < a < b < c, there exists a 
graph G with k(G) = a, A(G) = b and S{G) = c. 

In this paper, we study this inequality when restricted to the class of chordal 
graphs. We show that there is a “gap” between A(G) and k{G) in chordal graphs 
provided A(G) ^ S{G). More specifically, if A(G) S{G), then A(G) > 2 k(G) — 1. 



2 Edge Connectivity vs Vertex Connectiviy 
in Chordal Graphs 

First we make some simple observations about mincuts in undirected graphs. 
Remember that mincut is a minimum collection of edges which when removed 
disconnects the graph. Let V be partitioned to nonempty disjoint sets A and B 
by a cut. We denote the set of edges in the cut by (R, B). 

Lemma 1. Let {A, B) be a mincut of a connected undirected graph G. Then 
G{A) and G{B) are connected. 

Proof : Suppose that G{A) is not connected. Let G{Ai) be a connected 
component of G{A), where Ai C A. Let A 2 = A — Ai. Clearly {Ai, B[J A 2 ) is a 
cut of G and \{Ai,B IJ ^ 2 )! < \{A, B)\, since {A\,B IJ A 2 ) C (R, B). But this is 
a contradiction since {A, B) is assumed to be the mincut. □ 

Lemma 2. Let {A, B) be a mincut ofG, and let A(G) = |(R, i?)| < 5{G). Then 
there exists a node x € A, such that x ^ N{B). Similarly 3y G B such that 
yiN{A). 

Proof : Suppose that every node in A is adjacent to some node in B. Let 
M be a node in A. Let F = (R, B), be the mininum cut. We have 

d{u) = |iV(w) f| R| + \N{u) f| R| < \N{u) Pi R| + |A| - 1 

But 

|F| = ^ |IV(a;)f|R| > |V(u)Pi?| + |A| - 1 

x^A 

since each term in the sum should be at least 1 by the assumption that every 
node in A is adjacent to at least one node in B. It follows that d{u) < |F|. 
But by assumption, d{u) > S{G) > |F"|. Thus we have a contradiction and we 
conclude that there is a node x G A such that x ^ N{B). By similar arguments, 
3y G B such that y ^ N{A). □ 



Corollary 1. Lf (A,B) is a mincut of G, and if \{G) yf i5(G) then |R| > k{G), 
\B\>k{G). 
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Proof : By lemma 2 , there exists a node x & A such that x ^ N{B). Let 
y G B. It is clear that N{B) separates x from y. Thus |-/V(i?)| > k{G). But 
N{B) C A — {x}. It follows that k(G) < \A\. Similarly k(G) < \B\. □ 

A chordless path from rt to u is defined to be a path from m to v in G with 
no two non consecutive nodes of the path being adjacent. The reader can easily 
verify that if there is a path between u and v then there is a chordless path also. 
For example, the shortest path between u and v has to be a chordless path. 

Lemma 3 . Let P = {w\,W2, - ■ ■ ,Wk) he a chordless path in a chordal graph 
G and let Wi = highest(P). Then {wi,W2, ■ ■ ■ ,Wi) is an increasing path while 
{wi, Wi+i, ■ ■ ■ ,Wk) is a decreasing path. 

Proof: Consider first the path {w\, W2, - ■ ■ , Wi). Since Wi = highest(P), this 
cannot be a decreasing path. If f < 2 , obviously this is an increasing path. 
Let f > 2 . If {wi,W2, ■ ■ ■ ,Wi) is not an increasing path, then there should be a 
node Wj G P, 1 < j < i, such that PEO{wj) < PEO{wj+\) and PEO{wj) < 
PEO{wj-i). Then {wj-i,Wj+i} C Nh{wj). By definition ofPEO, Nh{wj) forms 
a clique, and we have (wj-i,Wj+i) G E. So (ruj_i, Wj+i) forms a chord for P, 
contradicting the chordless property of P. We infer that {wi,W2, ■ ■ ■ ,Wi) is an 
increasing path. A similar argument shows that (wj, • • • ,Wk) should be a 
decreasing path. □ 

Corollary 2 . Let P = {w\,W2, • • • , Wk) he a chordless path and Wk = highest(P) 
Then P is an increasing path. 

Corollary 3 . Let P = {wi,W2, ■ ■ ■ ,Wk) he a chordless path and PEO{w\) < 
PEO{wk). Then PEO{w 2)> PEO{wi). 

Let G = (V,E) be a chordal graph and (A,B) be a mincut of G. Without 
loss of generality assume that B contains the node with PEO number n. Let 
X = highest(A). Let Bi = B f] N (x) and B2 = B — Bi. 

Lemma 4 . Lf |i?i| < k{G), and A(G) yf <^(G), then P EO{highest{B2)) < 
PEO{highest{A)). 

Proof : Let x = highest(A) and y = highest{B2). (Note that since \B\ > 
k{G) (by corollary 1 ), and |Bi|<k(G), B2 is nonempty and therefore highest{B2) 
exists). First observe that Nh{x) C Bi, since every neighbour of x with a higher 
PEO number has to be in B. We will prove the lemma by contradiction. In other 
words, we will show that if PEO{y) > PEO{x), then |iV/i(a;)| > k{G), thereby 
contradicting the assumption that |i?i| < k{G), since Nh{x) C B\. Note that y 
is non adjacent to x since y G B2. We will show that Nh{x) separates x from y. 
Consider any path P from x to y. Clearly there is a chordless path P' from x to 
y, containing only the nodes in P. Let w be the second node in P' , starting from 
X. By corollary 3 PEO{w) > PEO{x). Thus w G Nh{x). We have shown that in 
every path P from x to y there is a node which is in Nh{x). Therefore Nh{x) is a 
separator and |N^(a;)| > k{G). This in turn implies |i?i| > k(G), contradicting 
the assumption. □ 
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Theorem 3. For a chordal graph G, if \{G) ^ 5{G), then A(G) > 2k(G) — 1. 



Proof : Let {A,B) be a mincut of G. Remember that |(T, R)| = A(G) < 5{G). 
Without loss of generality, let B contain the node with PEO number n. Let 
X = highest{A). By lemma 2, there is a node a G A, and a node b G B such 
that a ^ N{B) and b ^ N{A). We observe first that a yf x. To see this, consider 
a chordless path from x to the node with PEO number n, which is in B by 
assumption. By corollary 2, this is an increasing path and since x = highest(A), 
the second node in this path after x has to be in B. It follows that x G N{B). 
Therefore x yf a. 

Now since a and b are nonadjacent, by Menger’s theorem (Theorem 2), there 
exist at least k{G) vertex disjoint paths between them. One of these paths may 
contain x. Consider k{G) — 1 vertex disjoint paths between a and 6, which do 
not contain x. Each of these paths should exit A, thus contributing at least one 
edge to the mincut (T, B). Collect one such edge from each path and let Fi be 
the set of k(G) — 1 edges collected this way. We emphasize that none of the edges 
in Fi is incident on x,a or b. (Since we have avoided the path containing the 
node x; and none of the edges incident on a or 5 are in {A, B), by the choice of a 
and b). Moreover no two edges in Ej will share a common end point since they 
are all coming from distinct vertex disjoint paths. 

Now let Bi = N{x) p| B, be the set of neighbours of x which are in B. Let 
F 2 = {(a^, m) : u G B{\, the set of edges starting from x and ending at a node in 
Bi. Clearly F 2 C {A,B) and E 2 PIF 1 = 4>. If \Bi \ > k{G), IF 2 I > k{G) also and 
we have A(G) = | {A, B)\ > |Ei | + IE 2 1 > 2k(G) — 1, as required by the theorem. If 
|i?i| < k{G), let B 2 = B — Bi. B 2 is nonempty since \B\ > k{G) by corollary 1. 
Let y = highest{B 2 ) ■ By lemma 4, PEO{y) < PEO{x). Noting that x and y are 
nonadjacent, there exist k(G) vertex disjoint paths between them, by Menger’s 
theorem (Theorem 2). Clearly at least I = k{G) — |i?i| of them should be paths 
which consist of vertices in A[J B 2 only. Let these paths be Pi, P 2 , ■ ■ ■ , Pi- For 
each path Pi, 1 < i < I, there exists a chordless path P/ from y to x which uses 
only the nodes of Pi. Since x and y are nonadjacent, each of these chordless paths 
should contain at least 3 nodes. Let Ui be the second node in P', starting from y. 
We claim that Ui G A. This is because PEO{ui) > PEO{y) by corollary 3 and 
y = highest{B 2 ). Since Ui ^ Pi, it has to be in A. Thus (ui,y) G (A,B). Now 
consider a chordless path in B from y to the node having PEO number n. Since 
G(P) is connected by lemma 1, such a chordless path exists and clearly it is an 
increasing path. Let w be the second node in this path starting from y. Clearly 
Ui and w are neighbours of y with higher PEO number than that of y itself, 
ie {w,Ui} C Nh{y). By definition of PEO, Nh{y) forms a clique and therefore 
{ui, w) G E. Clearly since w G B and Ui G A, {m, w) G (A, B). Since no two edges 
in F\ shares a common end point, one of the two edges {ui, y) or (ui, w) is not in 
Pi. Now for each P', I < i < I, add to P3, one of {ui, w), {ui, y) which ever is not 
in Pi. Clearly jPaj = I = k(G) — |Pi|. Also P3 H P3 P| ^2 = (f- Since 

C (A,P),wehaveA(G) = |(A,P)| > \P^\ + \F 2 \ + \F 3 \ = 2n{G)-l, 
as required by the theorem. This completes the proof. □ 
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Abstract. Graham and Harary [3]studied how the diameter of hyper- 
cubes can be affected by increasing and decreasing edges. Since many 
networks are constructed by graph products, e.g., tori and meshes, in 
this paper we study how the diameter of graph products, particularly 
Cartesian product and Kronecker product, can be changed by adding 
and removal of edges. We study Cartesian products on paths, cycles, 
trees, and hypercubes. The diameter of the Kronecker product of two 
graphs is in general difficult to find. We in particular study the Kro- 
necker product of two cycles. 



1 Introduction 

Let G = (V,E) denote a graph. For any two vertices u,v in G, let ( 1 g{u,v) 
denote the distance between u and v which is defined as the length of a short- 
est path between them. The diameter of G, denoted by D{G), is defined as 
D{G) = maxu,vev doiu, v). Graham and Harary [ 3 ] studied how the diameter of 
hypercubes can be affected by increasing and decreasing edges. We are interested 
in the following measures defined in [ 3 ]: 

D~{G): the least number of edges whose addition to G decreases the diameter; 
D'^{G): the least number of edges whose deletion from G increases the diameter. 

For example, since deleting an edge from a cycle of length to, denoted by Gm, 
increases the diameter from [to/ 2 J to to — 1 , it follows that D^{Gm) = 1 - The 
superscript of D indicates the “change” of diameter, i.e., to mean decreasing 
the diameter, and “-I-” increasing the diameter. 

Many popular networks are constructed by graph products. We are partic- 
ularly interested in Cartesian product and Kronecker product. Let Gi and G2 
be two graphs, and we write Gi = (Vi,Ei) and G2 = {V2,E2) throughout the 
paper. The Cartesian product G = (V, E) of Gi and G2, denoted by G = Gi • G2, 
is given by U = Vi x V2, and 

E = {{uiU2,ViV2) \ui=vi and (u2,V2) G E2, or U2 = V2 and (ui,Vi) G Ei}. 

The Kronecker product G = (V,E), also known as direct product and tensor 
product, of Gi and G2, denoted by G = Gi x G2, is given by U = Vi x V2 and 

E = {{uiU2,viV2) I (wi,ui) G El and (u2,V2) G E2}. 
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A torus is the Cartesian product of two cycles, and mesh the Cartesian product of 
two paths. The diagonal toroidal mesh discussed in [ 6 ] is the Kronecker product 
of two cycles. In [2], the authors studied the Cartesian products of complete 
binary trees, shuffle-exchange graphs and de Bruijn graphs, which possess various 
good properties. 

We adopt the following notation. Let and C„ denote a path and a cycle, 
respectively, of n vertices labeled by 0,1, ...,n — 1. Let be a hypercube of 
dimension n of 2 " vertices with each vertex denoted by an n-bit binary repre- 
sentation, where two vertices are connected by an edge if and only if two vertices 
differ at exactly one bit. 

Since we work on graph products, for ease of exposition we use (), [] and () 
to delimit a vertex, a pair of vertices and an edge, respectively. For example, 
each vertex in G = Gi • G 2 is denoted by (^ 1 ,^ 2 ), where vi G Vi and V 2 € ¥ 2 - An 
edge in Gi • G 2 is denoted by ((mi, M 2 ), (ui, V 2 )). We also use ( and ) to delimit 
a path. 

2 Preliminaries 

Let S{G) denote the minimum degree of G, i.e., S{G) = min{SG{v) \ v in G}, 
where 6g{v) denotes the degree of a vertex v in G. It is obvious that D'^{G) < 
S{G). 

It is known that D{P„) = n—1, D {Pn) = 1 and D^{Pn) = 1. The diameter 
of cycle G„ is given by D{Gn) = \n/‘2-\ - Obviously, D+(G„) = 1. It is shown 
in [4] that H“(G„) = 2 for n > 8 by adding the following two edges: 

ei = (0, \ n/2\), 

_ f ([n/4j, |"3n/4]) for n = 2 mod 4, , , 

1 (L^/ 4 j 5 L3^/4J) for n = 0 , 1 , 3 mod 4. ^ ' 

Hypercubes have been extensively studied. Let m be a vertex in Q„. It is 
known that is n-connected. Let x and y be two vertices in Qn- We use 
h[x,y] to denote the hamming distance between x and y which is defined as 
the number of differing bits in x and y. It is shown in [3] that D~{Qn) = 2 
by adding two diagonal edges in any subcube Q 2 of Q„, e.g., (uq, 00 • • • OH) 
and (00 . . . 001, 00 . . . 010). It is well known and shown in [5] that there are n 
internally node-disjoint paths between x and y, h[x,y\ of them having length of 
h[x, y] and n — h[x, y] having length of h[x, y\ + 2. It follows that = n—1 

by deleting n —1 edges in a subcube Qn-i of Qn which are incident at any specific 
vertex [3]. For example, deleting (000,001) and (000,010) from Q 3 , the distance 
from 000 to Oil becomes 4, so is the diameter. 

3 Cartesian Products 

Cartesian products also preserve the node symmetry property of its two primitive 
graphs. To be specific, let Gi, G 2 be node-symmetric. The following lemma can 
be easily proved. 
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Lemma 1 Let Gi,G 2 be node- symmetric. Then G'i-G '2 is also node-symmetric. 

The following lemma can be easily shown. 

Lemma 2 D{Gi ■ G 2 ) = D{Gi) + D{G 2 ). 

For example, D{Tm,n) = D{Gm) + D{Cn) = L^/2J + \ n/2\. Furthermore, it can 
be easily verified that 

D+{Gi ■ G 2 ) > min {L>+(Gi), D+{G 2 )} + 1 (2) 

D~{Gi ■ G 2 ) > min{D-{Gi),D-{G2)}. (3) 

One may ask whether (2) and (3) always hold as inequalities. To answer this, 
consider Pm ■ Pn which is called mesh of size m by n. It can be easily verified 

that D~{Pm ■ Pn) = 1 and D^{Pm ■ Pn) = 2, i.e., (2) and (3) hold as equalities. 

We subsequently ask how to characterize those Gi and G 2 for (2) and (3) to 
hold, respectively, as equalities. Another interesting question is whether we can 
tighten the lower bound in (2). 

On the other hand, we wonder whether (2) and (3) can hold as equalities 
when Gi and G 2 are node-symmetric. Consider Tm,n = Cm • G„, where Ci is 
node-symmetric. We can show the following theorem with proof omitted. 

{ 2 for TO, n odd, 

3 for one of to, n odd, 

4 for TO, n even. 

The above theorem illustrates that (2) does not necessarily hold as equality 
when Gi and G 2 are node-symmetric. It is shown in [4] that D~{Cn) = 2 for 
n > 8 and D~{Tm,n) = 2 for to > 12 or n > 12, and (3) holds as equality. 

In what follows, we study T>+(Gi • G 2 ) and D~{Gi ■ G 2 ) for Gi,G 2 to be 
paths, cycles, hypercubes and trees. 



(11) D+{Cm ■ Pn) = 



(vi) D+{Qm ■ G„) = 



Lemma 3 (i) D~{Cm ■ Pn) = 1 for all n > 4. 

2 for TO odd, 

3 for TO even. 

(Hi) D~{Qm • P„) = 1 for n > 3. 

(iv) D+{Qm ■ Pn) = m. 

(v) D~{Qm ■ Cn) = 2 for n > 8. 

TO -I- 2 for n even, 

m-\- 1 for n odd. 

(vii) D~{T\ -T 2 ) = mm{D~ (Ti) , D~ {T 2 )} and D^{Ti - T 2 ) = 2, where T\ and T 2 
are arbitrary trees. 



After careful observation of the above theorem, we have the following con- 
jecture. 

Conjecture 1 If Gi and G 2 are node-symmetric, then 

D-{Gi-G2)=mm{D~{Gi),D-{G2)}. 

Furthermore, we conjecture that the lower bound in (2) can be increased as 
stated below. 



Conjecture 2 D+(Gi • G 2 ) > max{Z?+(Gi), Z?+(G 2 )} -I- 1. 
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4 Kronecker Product of Cycles 



Kronecker product preserves good properties of its two primitive graphs. For 
example, if Gi and G 2 are node-symmetric, Gi x G 2 is also node-symmetric. 

The diameter of a Kronecker product on two arbitrary graphs is in general 
difficult to find. In this section, we consider Kronecker products of two cycles Cm 
and G„. In this section, we calculate D{Cm x G„) using different approach from 
that in [6] and find D+(Gm x G„) and give an upper bound for D~{Cm x G„). 
If both m and n are even. Cm x G„ becomes disconnected. In this section, we 
assume that at least one of m and n is odd. 

In Gfc, let N{i) denote the zth-neighborhood of vertex 0 which is defined as 
the set of the vertices in G^ that can be reached from 0 at the zth step. Then 

— — i + — for z odd, 

* {0, 2, 4, . . . , z, /c — z, fc — z -I- 2, . . . , fc — 2} for z even. 

For example, given k = 11, we have 

fV(l) = {l,10}, iV(2) = {0,2,9}, 1V(5) = {1,3,5,6,8,10}, 

N{6) = (0, 2, 4, 5, 6, 7, 9}, iV(9) = (1, 2, 3, 4, ... , 10}. 



With careful observation, we can obtain the following remark. 

Remark 1 (i) i = [fc/2j is the smallest integer satisfying 

u;=i^(j) = {0,l,2v, k-l}, 

and in particular, iV([fc/2j — 1) U N{ [A:/2J) = |0,1,2,...,A: — 1}. 

(ii) \k/2\ ^ N{i) for all i < \k/2\ — 1. 

(Hi) N{i) C N{i + 2) for 1 < z < rz — 3, and in particular, N{i) = N{i + 2) for k 
even and k/2 < i < k — 3. 



In this section, let G = Cm x G„ and d* = D{C). Since G is node-symmetric, 
we can consider d* as the longest distance from (0, 0) to all of the other vertices. 
In order to distinguish the zth-neighborhood of vertex 0 in Cm and G„, we use 
Ni{i) and N2{i) to denote that in Cm and G„, respectively. Henceforth, we use 
N(i) to denote the zth-neighborhood of vertex (0, 0) in G without ambiguity, i.e., 
the set of vertices in G that can be reached from (0, 0) at the zth step. It follows 
that N{i) = Ni{i) x N2{i). The diameter d* is the smallest integer z satisfying 
Uj=i ^U) = ViX ¥ 2 - It can be easily verified that d* > maa;{[m/2j, [rz/2j}. 

In this paper we consider that both m and n are odd and assume without 
loss of generality that n > m. The case with exactly one of m and n being odd 
can be similarly treated. We have the following results. 

Lemma 4 Let both m and n be odd, and n> m. Then 




for m = n, 

for m + 2 < n < 2m + 1, 
for n > 2m + 3. 



D{Cm X G„) 
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Proof: We assume without loss of generality that m < n. To find d* , we distin- 
guish the following cases: 

Case (i): m = n. 

Since Ni{m— 1) = Vi and Ni(i') = N 2 (i), it follows that 7V(m — 1) = Vi x V 2 
and d* < m — 1. On the other hand, since Ni{m — 1) — Ni{m — 2) = {0, m — 1} 
and Ni{m — 2) — Ni{m — 3) = {!}, it follows that the vertices (0, 1), (1,0), 
(0,m — 1), and (m — 1, 0) are not in N(m — 2). Therefore, m — 2 < d* < m — 1, 
i.e., d* = m — 1. 

Case (ii): m -I- 2 < n < 2m + 1. 

Since m > [n/2j, it follows that N 2 {m — 1) U N 2 {m) = V 2 . Furthermore, 
since Ni{m) = Vi, it follows that N{m) = Vi x V 2 and thus d* < m. On the 
other hand, since Ni{m— 1) — Ni{m — 2) = {0}, N{m— 1) does not contain the 
vertex set {(0,j) | j G N 2 {m) — N 2 {m — 1)}. It follows that m — 1 < d* < m, 
i.e., d* = m. 

Case (iii): n > 2m + 3, i.e., [n/2j > m -I- 1. 

Note that A^i([n/2J — 1) = A^i([n/2J) = Vi and d* > L^/2J. On the other 
hand, since A^2(L^/2J — 1) U A^2(L^/2J) = V 2 as stated in Remark l(i), it follows 
that Ui=i^^ ^ ^ 2 ( 1 ) = V. Thus d* = [n/2j. The lemma follows. □ 



Our proof method for Lemma 4 is different from that in [6]. We then can 
show the following theorem with proof omitted. 



Theorem 2 Let m,n he odd, and n>m. 



D+{C^ X C„) 



1 

2 



if n = 2m + 1, 
otherwise. 



We can also give a construction of adding edges such that the diameter is reduced 
which also provides an upper bound of D~{Cm x C„). 
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Abstract. All vertex-sets of cliques of G form the simplexes of an ab- 
stract complex C{G), which is called the clique complex oi G. It is proved 
that the clique complex of a plane graph G is acyclic if and only if G is 
contractible. 



1 Introduction 

Graphs considered are finite and simple, complexes considered are finite. Unde- 
fined signs can be found in [1,2]. An n-complex is a complex of dimension n. 
n-simplex, n-chain and n-cycle are defined analogously. The topology properties 
of graphs have important applications in image analyses [3]. 

Given an n-complex K and integer 0 < q < n, Hq{K) and x(A') denote the 
qth homology group of K with coefficients integers J and the Euler characteristic 
of K, respectively, d is the boundary operator on K. An n-complex K is called 
acyclic if Hq{K) = 0 for q > 0. Two complexes K and L are homologous, denoted 
by AT ~ A, if Hg{K) = Hg(L) for each <; > 0. 

Given a graph G = (V (G), E{G)). Nc{u) denotes the set of vertices adjacent 
to u in G. doiu) and 6{G) denote the degree of u and the minimum degree 
of G, respectively. A A- vertex is vertex of degree k. For a set S' C V{G), G[S] 
denotes the subgraph of G induced by S. A clique is a complete subgraph of G. 
All vertex-sets of cliques of G form the simplexes of an abstract complex G{G), 
which is called the clique complex of G. For convenient, we use Hg(G) to denote 
the qth homology group of G{G), and use a sequence of vertices of G to denote 
an oriented simplex of G{G). 

Defination 11 (Ivashchenko) A family 5ft of graphs is called contractible if 
Ail G 5ft and each graph in 5ft can be obtained from Ki by following contractible 
transformations: (1) Deleting a vertex v from G, where G[A’g'(u)] G 5ft; (2) 
Adding a vertex v (G) to G and joining it to each vertex of an induced sub- 
graph Gi € IR of G; (3) Deleting an edge uv G E{G), where G[Na{u) p| A^g(u)] G 
5ft; (j) Adding an edge uv ^ E{G) to G, where G[Nc{u) fj Ng{v)] G 5ft. All graphs 
in 5ft are called contractible. 

Theorem 11 W (Ivashchenko) Contractible transformations do not change the 
homology groups of graphs. Especially, if a graph G is contractible, then Hq{G{G)) 
= J, and Hg{G{G)) = 0,q > 1. 

* Research partially supported by the Innovation Fund, of CAS. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 395-399, 2001. 

© Springer- Verlag Berlin Heidelberg 2001 




396 Baogang Xu 



By Theorem 11, C{G) is acyclic while G is contractible. We guess that each 
connected graph with acyclic clique complex is contractible. Our main result is 

Theorem 12 Let G he a connected plane graph. Then C(G) is acyclic iffG is 
contractible. 

2 Homology Groups of Plane Graphs 

Let G be a plane graph. When we say that iL is a subgraph of G, it means that 
the plane embedding of is a restriction of G on H. For a face / of G, we use 
6(/) to denote a sequence of vertices in clockwise order on the boundary of /. 
A face / of G is called a fc-face if / is incident with exactly k edges. A maximal 
subgraph of G is a maximal plane graph which is a subgraph of G. Let G be a 
cycle of G, we use Intc{G) to denote the set of vertices located in the interior 
of G. 

Let be a maximal subgraph of a plane graph, {/i, / 2 , ...,/;} the set of 
faces of H. Suppose that fi is the unbounded face of H. We define link{H) = 
Y^i =2 Kfi) ~ Kh)- Then link{H) is a cycle of G(G). 

We use T(G) to denote the set of maximal subgraphs of G which has at least 
four vertices and contains no separating triangles(a triangle uvwu is a separating 
triangle if both the interior and the exterior of uvwu contain vertices of G) , and 
let LT(G) = {link{H)\H G T(G)}. 

One can easily check the following 

Lemma 21 If G is a plane graph, then Hq{G) = 0 for q > 3. 



Lemma 22 If G is a plane graph, then any three distinct 3-simplexes of G{G) 
can not share a common 2-simplex. 



Let X = S J) = SSi be two g-cycles of a 

complex K. If |si| < \ri\{i = 1,2,..., aq), riSi > 0 while Si yf 0, and ri^ ^ Siq 
for some io G {1,2,..., aq}, then 2 ; is called a proper subcycle of x. A basic cycle 
is a cycle containing no proper subcycle. 

Lemma 23 Let G he a plane graph, c a 2-cycle of G(G). Then, c is a basic 
cycle if and only if c= link{H) for some maximal plane subgraphs H ^ K^. 

Proof. The sufficiency is clearly. Following we prove the necessity by contradic- 
tion. If it is not so, let c be a basic 2-cycle and c yf link{H) for any maximal 
plane subgraph H ^ K^. Let Hq denote the subgraph of G induced by the edges 
corresponding to the 1-simplexes in c. If Hq is not a maximal plane graph, then 
it must has a non-triangular face, say /o, let uqVo be an edge incident with /q. 

Without loss of generality, assume that /o is the unbounded face of Hq . Be- 
cause c is a cycle, there must be two distinct triangles xqUoVoXo and yoUoVoyo in 
Ho such that Xq G IntHqiyoUoVo)- Let Gq = Ho[{yo,uo,Vo] U IntHoiyoUoVoVo)]- 
Then |G(Go)| < |G(G)|. 
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If there still exist l-faces in Go with I > 4, then take such a face, say fi, and 
an edge uiVi incident with fi. There must be two distinct triangles xiuivixi 
and yiUiViXi in Go such that xi € IntGo(yiUiVi) because c is a cycle. Let 
Gi = Go[{pi, ui, vi} U IntGo(yiUiViyi)]. Then, |t^(Gi)| < |V^(Go)|. 

Go ahead so on and so forth. Because V (Ho) is finite, there must be a triangle 
yuvy in Ho such that IntHg (yuvy) yf 0 and all faces of Ho in the interior of yuvy 
are 3-faces. Let H' = Ho[{y ,u,v}yMntHo{yuvy)\, and let c' = link{H'). Clearly, 
c' is a proper subcycle of c. A contradiction. I 

Let Gi and G 2 be two plane graphs, fi with boundary U1U2U3U1 and /2 with 
boundary V1V2V3V1 be two 3-faces of Gi and G 2 , respectively. We can embed G 2 
into the 3-face fi by identifying Ui with Vi (i = 1,2,3), respectively. This new 
plane graph is called a triangle- embedding 0 /G 2 in G\. 

Lemma 24 Let G he a plane graph, c a basic 2-cycle o/G(G). Then, 

c ~ i~{c)c, where r{c) = 0, or 1. (1) 

ceLT{G) 

Proof. By Lemma 23, there exists a maximal subgraph H yf K 3 such that 
c = link{H). Without loss of generality, suppose H ^ K 4 and H ^ T(G), u, w, v 
are the three vertices incident with the unbounded face of H in clockwise order. 
The proof is by induction on the order of H. It is obviously that \V{H)\ > 5. 
While \V{H)\ = 5, H is isomorphic to the graph obtained by embedded K 4 in 
K 4 . Obviously, (1) holds. 

Assume that (1) is valid while \V{H)\ < n {n > 5). Following suppose 
\V{H)\=n+l. 

Case 1. There exists x G Intniuvwu) such that H[{u,v,w,x}] = K 4 . Since 
H ^ K 4 , then at least one of the three cycles xuvx, xuwx and xvwx is nonempty. 
Without loss of generality, suppose xuvx, xuwx, xvwx are all nonempty. 

Let Cuv = link{H[{x,u,v}\J Intnixuvx)], = link{H[{x,v,w'\\J Intnix 
vwx)] and Cuw = Hnk{H[{x,u,w}[j Intnixuwx)]. Suppose abc is a simplex of 
c, q,quv,q_vw,<luw denote the coefficients of abc in c, Cuv, Cvw, Cuw, respectively. 
Then, by the definition of link{H), 

q = quv = 1, qvw = quw = 0, while {a, b, c} C Intnixuvx), (2) 

q = qvw = 1, quv = quw = 0, while {a, b, c} C Intuixvwx), (3) 

q = quw = 1, qvw = qw = 0, while {a, b, c} C Intnixuwx), (4) 

Since 2-simplex uwv appears only in c, xvu appears only in xwv only in 
Cvw, and xuw appears only in Cuw- Combining (2), (3) and (4), we have 

c — Cuv — Cvw — Cuw = —uwv -\- xwv -\- XUW -\- XVU = dxuvw, 

i.e., c ~ Cuv + Cvw + Cuw- (1) holds by the induction hypothesis. 

Case 2. There exists separating triangle abca G H. Let Ho = H[{a, b, c} IJ 
IntH{abca)], Hi = H[V{H) — Intniabca)]. Then, c = link{Ho) -\- link{Hi), 
hence (1) holds by the induction hypothesis. I 
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Lemma 25 Let G be a plane graph. Then chains in LT{G) are not boundary 
chains and any two chains in LT{G) are not homologous. 

Proof: By contradiction. Let H G T{G), c = link{H) = dz where 2 : is a 3-chain 
of G{G). Without loss of generality, suppose xoUqVoWo is a simplex of z such 
that xo G IntG(uoVoWoUo) and 

jIntG(uoVoWoUo)j= min{jInt(uvwu)jj3xGlnto(uvwu), xuvw is a simplex of z}. 

By the definition of T{G), H contains no separating triangle, so xoUqVo 
is not a simplex of c, and hence there exists yo G Intc(xoUoVoXo) — V{H) 
such that XQUoVoyo is a simplex of z. It is clearly that \IntG{xoUoVoXo)\ < 
\IntG{uoVoWoUo)\, contradicts to the choice of uqVoWqUo. I 

Lemma 26 Let G be a connected plane graph. Then, Hq{G) = J , Hi{G) = 
[{m -I- 1) — x(C'(G))] J, i? 2 (G) = mJ, where m = |T(G)|. 

Proof: Because G is connected, then G(G) is connected, by Lemma 24, Lemma 
25, Hq{G) = J, and H 2 {G) = mJ. Because each basic 1-cycle of G(G) corre- 
sponds to a unique cycle of G, then G (G) has no 1 dimensional torsion coefficient, 
andso, iLi(G) = [(m-hl)-x(G(G))]J. I 

A pseudo-maximal plane graph is a 2-connected plane graph which contains 
at most one non-triangular face. 

Lemma 27 Let G be a 2-connected plane graph such that Hi{G) = 0, and / a 
non-triangular face ofG. Then, there exists a pseudo-maximal subgraph H of G 
such that f is the unique non-triangular face of H . 

n 

Proof: Let G = V 1 V 2 . ■ . VnVi (n > 4) be the boundary of /, c = ^ ViVi+i (mod 

i=l 

n). 

Let {sf\i = 1,2,..., « 2 } be the set of 2-simplexes of G(G). Since iLi(G) = 0, 

OC2 

then there exists a 2-chain of G(G), say x = ^ Uisf, such that c= dx. We chose 

i—1 

such a 2-chain with the property that 

^2 0'2 Ot2 

^ loil = min{^ |6i||c= (5) 

Let H be the subgraph induced by those edges corresponding to the 1- 
simplexes contained in x. Let H* = H -\- G* = G -\- {uvi}2=i, where 

u^V{G). 

Clearly, G* is a plane graph, H* is a subgraph of G*, and y = uVi+iVi 

(mod n) is a 2-cycle of G(G*). According to (5), y has no proper subcycles, i.e., 
y is a basic cycle. By lemma 23, H* is a maximal plane graph, therefore / is the 
unique non-triangular face oi H . I 

Proof of Theorem 12: We only need to prove the necessity. By definition 11, 
we can assume that G is 2-connected. By Lemma 26 and Lemma 27, |P(G)| = 0 
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and G can be obtained by a serial of triangle-embeddings of its pseudo-maximal 
plane subgraphs. Since contractible transformations are inversable, it suffices to 
show that each pseudo-maximal plane subgraph of G with acyclic clique complex 
is contractible. 

Let B be a pseudo-maximal plane subgraph of G with acyclic clique complex. 
Without loss of generality, suppose that i? ^ ATa. It is easy to verify that B is 
contractible while \E{B)\ < 8. Now we proceed to the induction process. 

Suppose each pseudo-maximal plane graph with less than n edges and with 
acyclic clique complex is contractible, where n > 9. Let B be a pseudo-max;imal 
plane subgraph of G with acyclic clique complex, \E{B) \ = n, G the boundary 
of the non-triangular face of B, and uv an edge of C . Then we have 

|]Vb(m) n ]Vb(u)| = 1. (6) 

If it is not so, let X\,X 2 G Nb{u) r\NB{v) and IntB{xiuvx\) C\V{B) = 0. 

Case 1. x\X 2 G E{B). Clearly, G[{x\,X 2 ,u,v}] = K 4 . If Inta{xiX 2 VXi) U 
IntH{xiX 2 

uxi) = 0, then d,B{xi) = dcixi) = 3. By the definition of contractible trans- 
formations, x\ can be deleted from B, and by the induction hypothesis, B is 
contractible. So we assume Into{xiX 2 VXi) U Into{xiX 2 UXi) yf 0. Go on and 
go forth, one can always find a vertex xq G Inta{uvx 2 u) such da{x) = 3 and 
{x} U Nb{x) induced a K 4 , then xq can be deleted from B, and so B is con- 
tractible by the induction hypothesis. 

Case 2. X 1 X 2 ^ E{B). There must be a vertex in IntB{uxiVX 2 u) which is 
adjacent to a;i because B is a pseudo-maximal plane subgraph. Therefore, either 
there exists a 3- vertex a in i? such that i?[{a}UNB(a)] = K 4 which indicates that 
deletion of a from B is a contractible transformation, or B contains a subgraph 
which is in T(G) which contradicts to that G{G) is acyclic. 

Now, (6) is proved. Deletion of uv from B is a contractible transformation. 
Let Bi = B — uv. If S{Bi) = 1, then one of u or v, say u, is a 2-vertex in B, so 
deletion of u from B is a contractible transformation, and hence by the induction 
hypothesis, B is contractible. If 5{Bi) > 2, then B\ is a pseudo-maximal plane 
subgraph of B and \E{Bi)\ = n — 1, again by the induction hypothesis, Bi is 
contractible. I 
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Abstract. This work deals with the domination numbers of generalized 
de Bruijn digraphs and generalized Kautz digraphs. Dominating sets for 
digraphs are not familiar compared with dominating set for undirected 
graphs. Whereas dominating sets for digraphs have more applications 
than undirected graphs. We construct dominating sets of generalized 
de Bruijn digraphs under some conditions. We investigate consecutive 
minimum dominating set of the generalized de Brnijn digraphs. For gen- 
eralized Kautz digraphs, there is a consecutive minimum dominating set 
that is a minimum dominating set. 



1 Introduction 

It is one of major areas in theoretical and algorithmic observation to study dom- 
ination and its related topics for undirected graphs [1,11]- Besides domination 
has been defined in digraphs, concepts of solution and kernel which are pecu- 
liar to digraphs have been defined. The origin of solution and kernel is in game 
theory [10]. Solution and kernel have applications for logic and facility loca- 
tion [4,7,9,16]. Considering applications of domination and its related topics for 
game theory and logic, digraphs are more natural and have more applications 
than undirected graphs. Although digraphs have many applications, there are 
few studies for domination in digraphs. Barkauskas and Host, and Bar- Yehuda 
and Vishkin [2,3] have studied algorithms for domination of digraphs. Barkauskas 
and Host showed that the problem of determining whether or not an arbitrary 
digraph has an efficient dominating set is NP-complete and gave a linear time al- 
gorithm for finding efficient dominating sets in directed trees [2] . There are some 
researches for kernel in the digraph with no odd circuits [6] and for domination 
in tournament that is a class of digraphs [8,14]. 

Each edge has orientation in digraphs, these edges are called arcs. This sit- 
uation cause difficulty of research for domination in digraphs. 
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2 Definitions and Terminologies 

V{G) and A{G) are the vertex set and the arc set of a digraph G{V,A), respec- 
tively. There is an arc from a: to y if {x, y) G A{G). The vertex x is called a pre- 
decessor of y and y is called a successor of x. The sets 0{u) = {w|(u, v) € Gl(G)} 
and I(u) = {riKu,?/) G A{G)} are called the outset and the inset of the vertex 
u, respectively. Similarly, we define 0[u] = 0(u) U {m}, I[u] = I{u) U {u}. If 
S C V{G) then 0{S) = UsesO(s), I{S) = Usgsl(s), 0[S'] = UsgsO[s], and 
I[S] = Usgs/[s]. A set S' C V{G) is a dominating set of a digraph G if u is a 
successor of some vertex m G S for all v ^ S. Aset S CV (G) is an absorbant of a 
digraph G if there is a vertex u G S such that u is a successor of v for all v ^ S. 
A set S C V{G) is independent if for any x,y G S,{x,y) ^ A{G). If S C V{G) 
is independent and a dominating set of a digraph G, S is called a solution of 
G. If S C T (G) is independent and an absorbant of a digraph G, S is called a 
kernel of G [10]. |A| denotes the cardinality of a set X. For a vertex u G V{G), 
|0(u)| is called the out degree of u. We use the notation 7(G) for the minimum 
cardinality of a set S C T (G) which is a dominating set. If S' is a dominating set 
of G whose cardinality is equal to 7(G), S is called the minimum dominating set 
of G. Let m, n be positive integers, (m, n) denotes the greatest common divisor 
of m and n. m\n and m /fn mean that m divides n and m does not divide n, 
respectively. 

In this paper, we will describe domination numbers of generalized de Bruijn 
digraphs and generalized Kautz digraphs. Furthermore we construct minimum 
dominating sets of generalized de Bruijn digraphs and generalized Kautz di- 
graphs. Definitions of generalized de Bruijn digraphs and generalized Kautz 
digraphs are as follows. Generalized de Bruijn digraph Gs(n, d) is defined by 
congruence equations. 

r K(Gs(n,d)) = {0,l,2,...,n-l}, 

\ AiGsin, d)) = {{x, y)\y = dx i (mod n), 0 < i < d}. 

If n = d^, GB{n,d) is the de Bruijn digraph B{d,D). 

Whereas definition of generalized Kautz digraphs GK{n,d) is 

fK(GK(n,d)) = {0,1, 2 ,..., n- 1}, 

{ A^Gxin, d)) = |(x, y)\y = —dx — i (mod n), 0 < i < d}. 

If n = d(d — 1)'°“^, G/c(n, d) is the Kautz digraph K{d — l,D). Generalized de 
Bruijn digraphs and generalized Kautz digraphs are introduced by Imase and 
Itoh [12,13], and Reddy, Pradhan and J. Kuhl [15], so called Imase Itoh digraphs. 
It is well known that de Bruijn digraphs and Kautz digraphs have good prop- 
erty for network topology compared with hypercubes [5]. These generalizations 
remove the restriction for the cardinality of vertex sets of these digraphs and 
make these digraphs more valuable for network models. 




402 



Yosuke Kikuchi and Yukio Shibata 



3 Domination Numbers of Generalized de Bruijn Digraphs 



We prepare a simple lemma for determining domination number of generalized 
de Bruijn digraphs. 

Lemma 1. 7 (Gs(n,(i)) > 

Proof. Let S' be a minimum dominating set of Gsin, d).We obtain |S| +d|S| > n 
from the definition of GB{n, d). 

Above lemma gives the lower bound for the domination number of Gsin, d). We 
also obtain the following lemma concerning the upper bound of the domination 
number of Gs(n, d). 

Lemma 2. 'y{GB{n,d)) < 

Proof. For GB{n,d), let S = {1,2,..., [(n + G)/d \ }. Then 0(S) = {d,d+ l,d + 
2, . . . , [(n+ l)/dj d + d— 1} with duplications. |0(S)| = [(n + l)/dj d + d—1 = 
([(n + l)/d] — l)d+d— l>n. Thus S is a dominating set of GB{n, d). □ 

From the proof of Lemma 2, S = (1,2,..., [(n + l)/dj } is a dominating set, 
but not minimum. For example, the out set of a subset (1,2, 3,4} of Gb( 12,2) 
is {2,3,4, 5, 6, 7, 8,9}. Then {1,2, 3, 4} is not a dominating set of Gs(10,4). 
Whereas {1, 2, 3, 4, 5, 6} is a dominating set of Gs(10,4), still 7 (Gb( 12,2)) = 4 
because the vertex subset {4, 5, 6, 7} is a minimum dominating set of Gs(12, 2). 
Thus {1, 2, 3, 4, 5, 6} is not a minimum dominating set. Next theorem shows a 
sufficient condition for the domination number of Gs(n, d) to be |"n/(d+l)] 
and a method of determining the dominating set of Gs(n,(i). 

Theorem 1. If there is a vertex x G V{GB{n,d)) satisfying, 

(d— l)a;— ^ d^^ —1^=0 (mod n) (1) 

for some I (0 < I < (d + 1) |"n/(d + 1)] — n), then 





j{GB{n,d)) 




Proof. From Lemma 1, we show the correctness of this statement by constructing 
a dominating set of Gs(n, d) that contains x satisfying equation (1). Let D = 
{x,x + 1, . . . ,x + \n/{d + 1)1 — 1} C y (GB{n, d)). Then out sets of vertices in 
D are as follows. 

0(x) = {dx, dx + 1, . . . , dx + d — 1}, 

0(x + 1) = {d(a; + 1), d{x + 1) + 1, . . . , d(a; + 1) + d — 1}, 



0{x + \n/{d+ 1)] — 1) = {d{x + \n/{d+ 1)] — 1), d(a; + |"n/(d + 1)] — 1) + 1, 
. . . , d{x + [n/(d+ 1)] — 1) + d — 1}. 
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Thus \0{D) \ = d 



d+l 



, and \D\ = 



d+l 



. Since x satisfies equation (1), 



dx = X + \n/{d + 1)] — ? (mod n). 

Therefore \Dr\0{x) \ = I, and by the condition for I, we obtain \n/{d + 1)] —I > 0. 



\D\ + \0{D)\-\DnO{x)\ 

= \n/{d+ 1)~\ + d \n/{d+ 1)] — I 

> \n/{d+ 1)~\ + d \n/{d+ 1)] — {(d+ 1) \n/{d+ 1)] — n} 
= n. 



Hence _D is a minimum dominating set of G(n, d) . □ 

We obtain the following statement for a special case Theorem 1. 

Corollary 1. For GB{n,d), j{GB{n,d)) = 1 if and only if d > n or n is odd 
and d= n — 



Proof. We only show the necessary condition because sufficiency is obvious. Since 
7 (Gs(n,d)) = 1, there is a vertex x G V{GB{n,d)) such that |0[a;]| > n. If 
|0(a;)| > n then d > n because of the definition of GB{n,d). If |0(a;)| = n — 1 
then d = n—1 and x € V {GB{n, d)) has no loop. Hence, we obtain x+1 = {n—l)x 
(mod n). Therefore 2a; + 1 = 0 (mod n), then n is odd. □ 

From Theorem 1, we see the following corollary for the solution of GB(ji,d). 

Corollary 2. Let d+ l\n. Then if there is a vertex x € V(GB(n, d)) such that 

Ti 

(d—l)x ; = 0 (mod n), 

a + 1 

then there is a solution S of G B{n,d) such that 



We prepare the reversal of a digraph G, denoted by G“^. The reversal of a 
digraph G is defined by V{G~^) = V{G), (x,y) G A{G~^) if and only if 
{y,x) G A{G). For generalized de Bruijn digraphs, GB{n,d) is not necessar- 
ily isomorphic to GB{n,d)~^ . For de Bruijn digraph B{d,D), we can see that 
B{d,D) = B{d, D)~^ . Then there is one to one correspondence between the 
dominating set and the absorbant in B{d, D). For B{d, D), the minimum cardi- 
nality of an absorbant is equal to the domination number. Therefore, we obtain 
the following statement. 

Corollary 3. For B{d,D), both the minimum cardinalities of the absorbant and 
the dominating set are ■ 
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Proof. We first assume that D is even. Then the value I in Theorem 1 takes an 
integer between 0 to d and from the definition of B{d, D), d > 2. We put 

a: = + . . . + + ... + d^ + l (mod d^), 

1=1, 

then X satisfies equation (1) in Theorem 1. Next let us assume that D is odd, 
then the value I in Theorem 1 takes an integer between 0 to L We put 

? = 1, 

then X satisfies equation (1) in Theorem 1. □ 

Minimum dominating set is not necessarily a set of vertices whose labels are 
consecutive. For Gs(10, 5), both {1, 2} and {3, 6} are minimum dominating sets 
of Gs(10,5). The set {1,2} is constituted of consecutive vertices and the set 
{3, 6} is not constituted of consecutive vertices. Thus Theorem 1 gives a sufficient 
condition but not a necessary condition. It is not easy to find all minimum 
dominating sets of Gs(n, d). We can not determine that domination number of 
a given G_b(u, d) is equal to \n/{d+ 1)] only using Theorem 1 because Gsin, d) 
may have a minimum dominating set which is constituted of non-consecutive 
vertices and whose cardinality is equal to |"n/(d+ 1)]. Then we give a necessary 
and sufficient condition for Gs(n,d) to have a minimum dominating set S with 
l'5'l = In-Zid + 1)] and constituted of consecutive vertices. We call such S a 
consecutive minimum dominating set. 

Theorem 2. GB(n,d) satisfies one of the following four conditions if and only 
if there is a vertex x G V {Gsin, d)) such that {x, x + 1, . . . ,x+ \n/ {d + 1)] — 1} 
is a consecutive minimum dominating set. 

Conditions 

(i) d + 1 /n, 

(ii) d+ l|n and d is even, 

(Hi) 2{d + l)|n and d is odd and 4|d + 1, 

(iv) 2^{d + l)|n and d is odd and d + 1 contains only one 2 as a factor. 

(d — 1 = 2^s, s is odd) 

Proof. It is obvious that 7 (Gs(n, d)) = 1, when d>n. Suppose that d < n. We 
show that there is a vertex x in V{GB{n,d)) such that 

X + \n/{d + 1)~\ = dx + i (mod n), (2) 

0 < t < minjd — 1, \nj (d + 1)] (d + 1) — n|. (3) 

From equation (2), we obtain 

(d — l)x = |"n/(d + 1)] — f (mod n). (4) 

For equation (3), the value i takes an integer between 0 and d — 1 if d + 1 is not a 
divisor of n. If d + l|n, then i = 0. GB{n,d) has a consecutive dominating set S 
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with [S'! = \n/{d+ 1)] if and only if there is a vertex x G V{GB{n,d)) satisfying 
equation (4). If c?+ 1 is not a divisor of n, let {d — l,n) = s. Then there are q 
and r, (0 < r < s) such that \n/{d + 1)] = qs + r. We can put r = i, from the 
ranges of r and i. Thus there are x and i satisfying equations (2) and (3) in 
Next, if d + 1 is a divisor of n, 

{d — \)x = n/{d+ \) (mod n) (5) 

by equation (4) . The equation (5) has a solution if and only if 

((d- l),n)|n/(d+ 1). (6) 

Hence if d is even, there is an integer x satisfying equation (5) in Z„. 

Whereas if d is odd, d — l,d + 1 and n are even integers. We put d + 1 = 

{h{< 2) is an integer, a is an odd integer), then d — 1 = 2(2^“^o; — 1). We obtain 
2|d — 1 and 4 /fd — 1, since 2^~^a — 1 is odd. Therefore n is divided by 2^+^. 
Then n is divided by 2(d + 1). 

For d + 1 = 2^ a, we consider h=l, then put d — 1 = 2(o; — 1) = 2^/3, (/3 is odd). 
2?!.+i (divides n by (6). Therefore n is divided by 2^(d+l). Conversely if d) 

has a consecutive minimum dominating set {a;, a;+l, . . . , x+ \n/ (d+1)] —1}, then 
X satisfies equation (4). Therefore n and d satisfy one of above four conditions. 

□ 

We consider whether there exists GB{n,d) that does not have a minimum 
consecutive dominating set and ^{GB{n,d)) = \n/{d+ 1)]. We show that if 
|"n/(d + 1)] = 1,2, or 3, there does not exist GB{n,d) that does not have a 
minimum dominating set and ^{GB{n, d)) = \n/{d + 1)] . 

Theorem 3. For domination number of generalized de Bruijn digraphs, we ob- 
tain 



7(Gb(2s,2s-1)) = 2, (7) 

7(Gb(8s-4,4s-3)) = 3, (8) 

and 

7(Gb(6s,2s-1)) = 4, (9) 

where s is a natural number. 

Proof. For equation(7), each vertex of Gs(2s, 2s— 1) has a loop. Then 7 (Gb(2s, 
2s - 1)) 7^ 1 

For equation(8), let 7(Gb(8s — 4, 4s — 3)) = 2 and a vertex x is contained in a 
dominating set of Gb (8s — 4, 4s — 3) . Then x + 1 or a; + 4s — 2 is contained in the 
dominating set of Gb{8s — 4,4s — 3). If the dominating set is {x,x + 1}, then 



(4s — 3)x = X + 2 (mod 8s — 4), 
4(s — l)x = 2 (mod 8s — 4). 
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Therefore we have a contradiction. 

If the dominating set is {x, x + As — 2}, then we obtain 

(4s — 3)x = X + 1 (mod 8s — 4) (10) 

or 

(4s — 3)(x + 4s — 2) = X + 1 (mod 8s — 4). (11) 

If equation (10) holds, then (4s — 4)x = 1 (mod 8s — 4). This is a contradiction. 
If equation (11) holds, then (4s — 4)x = 4(5s — 4s^) — 5 (mod 8s — 4). This is 
also a contradiction. 

For equation(9), let us assume that 7(Gf,(6s, 2s — 1)) = 3. We should consider 
three cases ; (1) any two vertices are not consecutive in the dominating set, (2) 
exactly two vertices are consecutive in the dominating set, and (3) vertices in 
the dominating set are consecutive. Case (1); let a vertex x be in the dominating 
set, then other vertices in the dominating set are x + 2s and x + 4s. Thus x 
satisfies 

(2s — l)x = X + 2ts + 1 (mod 6s) 
for some t {t = 0, 1, 2). From this equation, we obtain 

2(s — l)x = 2ts + 1 (mod 6s), 

and this is a contradiction. Case (2); let x and x + 1 be consecutive vertices in 
the dominating set, then another vertex in the dominating set is x + 2s + 1 or 
X + 4s. If the dominating set is {x, x + 1, x + 2s + 1}, then 

f (2s — l)(x + 2s + 1) = X + 2 (mod 6s), 

( (2s — l)(x + 1) + 2s — 2 = X — 1 (mod 6s). 

By these simultaneous equations, we obtain 

4s(s — 1) = 1 (mod 6s). 

Then this is a contradiction. 

If the dominating set is {x, x + 1, x + 4s} then 

(2s — l)(x + 4s) + 2s — 2 = x— 1 (mod 6s). 

By this equation, we obtain 

2(s — l)x = 8s — 8s^ + 1 (mod 6s) 

and this is a contradiction. 

Case (3); let x, x + 1 and x + 2 constitute a dominating set. Then 

(2s — l)x = X + 3 (mod 6s). 

Hence we obtain 

2(s — l)x = 3 (mod 6s), 



and this is also a contradiction. 



□ 
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4 Domination Numbers of Generalized Kautz Digraphs 



We can see the following lemma similar to Lemma 1. 



Lemma 3. 7 (G/f(n, d)) > 



n 

d + 1 



□ 



Besides the difficulty of determining domination numbers of generalized de Bruijn 
digraphs, it is easy to determine the domination number of a generalized Kautz 
digraph GK(ji,d). 



Theorem 4. 



j{GK{n,d)) 



n 

d+1 



Proof. D 
GK{n,d) 



{ 0 , 1 , 2 ,.. 



n 

d+1 



1} C V{GK{n,d)) is a dominating set of 

□ 



5 Conclusion 

We investigated domination numbers of generalized de Bruijn digraphs and gen- 
eralized Kautz digraphs. Consecutive dominating set is important in our inves- 
tigations. Finding a dominating set of a generalized de Bruijn digraph is more 
difficult than that of generalized Kautz digraph. We conjecture that there is a 
consecutive minimum dominating set for Gs(n, d). 
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A Notion of Cross-Perfect Bipartite Graphs 
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Abstract. In this note, we consider four quantities defined on a bipartite 
graph B: the cross-chromatic index the biclique number w*{B), 

the cross-free matching number m*{B) and the biclique edge covering 
number l3*{B). We mention several applications of these numbers and 
define a notion of cross-perfect bipartite graphs. A duality between these 
numbers for the class of cross-perfect graphs is examined. 

Keywords: bipartite graph; perfect graph; integral polytope. 



1 Definitions 

Given a bipartite graph B = {UUV, E), two non-adjacent edges e,e £ E with 
e = (ui,Vi) and e = {u 2 ,V 2 ) are said to form a cross if (Mi,r' 2 ) G E and 
(t 62 ,ui) G E. Two edges are said to be cross-adjacent if either they are adjacent 
(i.e. share a common node) or they form a cross. A cross-free matching in B 
is a set of edges E C E with the property that no two edges e, e £ E are 
cross-adjacent. A cross-free coloring of i? is a coloring of the edge set E subject 
to the restriction that no pair of cross-adjacent edges has the same color. A set 
of edges E C E is said to form a hiclique if the induced subgraph corresponding 
to E is a complete bipartite subgraph of B. A hiclique edge cover for B is & 
covering of the edge set E by bicliques. 

We now consider the following four quantities on B: (i) The cross- chromatic 
index, of B which is the minimum number of colors required to get a 

cross-free coloring of B (ii) The hiclique number, w*{B), of B defined as the 
cardinality of the maximum edge biclique in B (iii) The cross-free matching 
number of B, m*{B), defined as the edge cardinality of the maximum cross-free 
matching in B and (iv) The hiclique edge covering number of B, j3*{B), defined 
as the minimum number of bicliques required to cover the edges of B. 

This note is concerned with a duality between the four numbers defined 
above. The duality, as stated here, resembles the classical duality between the 
four quantities, namely, stability number a{G), clique partition number 9{G), 
chromatic number 7 (G) and the maximum clique number w(G) for a perfect 
graph G [1][2][9]. In fact, we show that the duality between w*{B), x*{B), m*{B) 
and (3*{B) can be studied by investigating when a certain class of graphs is 
perfect. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 409-413, 2001. 
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2 Applications and Complexity Resnlts 

The problem of finding a maximum cross-free matching in a bipartite graph is 
discussed in [16]. This problem was studied as a special case of a more general 
problem of finding alternate cycle free matchings in bipartite graphs. Alternate 
cycle free matchings were first investigated in [10] in the context of dependence 
graphs of matroids: If At is a matroid with basis B, finding a maximum cardi- 
nality alternating cycle free matching is related to finding a basis of At for which 
the intersection with B is of minimum cardinality. Another application studied 
in [3] and mentioned in [16] concerns the computing of the setup number or step 
number v{G) of an acyclic directed graph G. 

The problem of finding a maximum edge cardinality biclique has applications 
in manufacturing in the computer industry. This application is discussed in detail 
in [5]. In short, the relationship between a set of products and a set of components 
which constitute the products can be represented by a bipartite graph. One way 
to reduce manufacturing lead times is to reduce the final assembly times for 
the products by creating sub-assemblies. Finding good sub-assemblies in this 
context can be done by finding large edge-cardinality bicliques in the bipartite 
graph. Another interest ing application, also described in [5], occurs in the area 
of formal concept analysis [7]. 

Covering the edges of a bipartite graph by bicliques has an interesting ap- 
plication [4] is combinatorial chemistry: Split Synthesis [6] [11], is a method of 
choice to build libraries for combinatorial chemistry. The basis for this technol- 
ogy is that molecules can be grown with one end tethered to a resin bead, while 
the other end is extended one residue at a time in parallel across a set of beads. 
Split synthesis can be modeled by a directed acyclic graph in which each node 
represents a grow step on a particular subset of beads. In [4], it is shown that 
optimizing split synthesis construction is intimately related to covering the edges 
of a bipartite graph by bicliques. 

Finding j3*{B) is NP-hard [8]. In [16], it is shown that finding m*{B) is NP- 
hard. Recently, R. Peeters [15] showed that finding w*{B) is also NP-hard. To 
the best of our knowledge, the complexity of finding x*(R) is open. 

3 Formulations and Results 

Clearly, a biclique cannot have more than one edge from a cross-free matching. 
This immediately gives the following integer programming formulation for w*{B) 

w*{B) = max{la; : K*x < 1, a; G {0, 1}"} 

where n = \E\ and K* is a (0, 1) incidence matrix whose rows correspond to 
maximal cross-free matchings of B and whose columns correspond to the edges 
of 5. It is also easy to see that 

X*{B) = min{ly : yK* > l,y G {0,1}™} 
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The fractional biclique polytope of B is P = {x € RJf : K*x < 1}. We denote the 
convex hull of feasible integer solutions of P hy Pj. We generalize the definitions 
of w* and x* to consider binary objective functions. For c G {0, 1}" 

w*{Bc) = max{ca; : K*x < 1, a; G {0, 1}"} 

X*{Bc) = min{ly : yK* >c,y€ {0, 1}™} 

Note that the definitions of w*{B) and w*{Bc) differ only in the objective func- 
tion. The constraint matrix K* is the same for both. Also, w*{Bc) does not 
correspond to the optimal edge cardinality biclique in the subgraph (either edge- 
induced or node-induced) corresponding to the vector c. This is an important 
distinction between the definition of w*{Bc) and the corresponding definition for 
the maximum clique in a graph: For a graph G, the formulation for the maximum 
clique, a{G), generalized to binary objective functions, can be written as 

a{Gc) = maxjca: : Kx < l,a: G {0, 1}^} 

where p = |t^(G)| and K is the clique matrix of G [14]. Here, a{Gc) corresponds 
to the maximum clique on the node-induced subgraph of G corresponding to c. 
A similar observation can be made about x*{Bc)- 

Definition: B is cross-perfect if w*{Bc) = x*{Bc) for all vectors c G {0, 1}". 
Theorem 31 B is cross-perfect iff P is integral. 

Proof. Consider the line graph, L{B), of B. For every 4-hole in L{B), add the two 
cords. Let the new graph be called L (B). We refer to L (B) as the modified line 
graph of B. Then, E C if is a cross- free matching m B \EV{E ) is a independent 
set in L (B). The clique polytope for L is Q = {cc : 5'a: < 1, x G i?+} where the 
rows of S correspond to maximal independent sets and the columns correspond to 
the nodes of L . But Q = P. Thus, B is cross-perfect implies that w{H) = j{H) 
for every node induced subgraph El oi L where w{H) denotes the cardinality of 
the maximum clique in H and y{H) is the chromatic number of H . Then, L is 
perfect by the weak perfect graph theorem [12] [13]. It follows that Q and hence 
P is integral. Conversely, if P is integral, so is Q and hence w{H) = y{H) for 
every node induced subgraph H oi L . Thus, w*{Bc) = x*{Bc) for all vectors 
c G {0,1}” and B is cross-perfect. 

Note 1: An alternative proof of Theorem 3.1 was suggested by Prof. R. Chan- 
drasekaran: The column-intersection graph of the matrix K* is the complement 
graph of the modified line graph L (G) defined above. The definition of the cross- 
perfectness and the perfect graph theorem [12] then imply that P is integral. 
Note 2: Even though the proof of Theorem 3.1 uses the correspondence between 
the biclique number w* and the clique number w, the structure of these two 
problems is fundamentally different. In fact, even the complexity question of 
computing w* was open until the recent result of Peeters [15]. Theorem 3.1 and 
Theorem 3.4 (given below) are thus remarkable since they mention a duality. 
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similar to the classical duality in the perfect graph literature, for fundamentally 
different problems. 

In general, the biclique polytope, P, is not integral. This follows because if P 
were integral, so would the clique polytope, Q, of the graph L (B) which in turn 
would mean that L (B) is perfect. However, it is easy to show a L (B) which 
contains an odd hole of length 5 as a node-induced subgraph (see figure below) . 




Fig. 1. A bipartite graph whose biclique polytope is not integral 



Corollary 32 B is cross-perfect iff the modified line graph L (B) is perfect. 

This observation suggests that the property of cross-perfectness in bipartite 
graphs can be studied by studying perfectness in modified line graphs. If B is 
cross-perfect then, by Theorem 3.1, the constraints K*x < 1 are sufficient to 
describe Pj. Note that every clique in L does not correspond to a biclique in 
B. However, every maximal clique in L does correspond to a biclique. Hence, 
a maximum clique, which is also a maximal clique, corresponds to a maximum 
biclique in B. This observation immediately gives us the following result. 

Theorem 33 If B is cross-perfect then w*{B) and X*{B) are polynomially solv- 
able. 

Every independent set in L corresponds to a cross- free matching in B. Con- 
sider a covering, C, of the nodes of L by cliques. A clique c* in C either cor- 
responds to a biclique in B or if not, c* is contained in a max;imal clique which 
corresponds to a biclique in B. Thus, a{L ) = m*{B) and 9{L ) = f)*{B). Let 
H he a node induced subgraph of L and cG{0,l}”be the incidence vector of 
the nodes corresponding to H. Then, it is easy to see that a{H) = m*{Bc) and 
9{H) = (3*{Bc). This correspondence along with the weak perfect graph theorem 
[12] [13] gives us the following result. 

Theorem 34 The following are equivalent 
1. w*{Bc) = x*{Bc) for all vectors c £ {0, 1}”. 
m*{Bc) = (3* (Be) for all vectors c G {0, 1}”. 



Theorem 35 Node-induced subgraphs of cross-perfect graphs are cross-perfect 
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Proof. The fractional biclique polytope, Ph, corresponding to a node induced 
subgraph H of B is a, face of P. Hence if P is integral, so is Ph- 

Corollary 36 If B is cross-perfect then w*{H) = x*{H) for all node induced 
subgraphs H of B. 

Getting an efficient recognition algorithm for cross-perfect bipartite graphs and 
the validity of the strong perfect graph conjecture [2] [9] for modified line graphs 
(defined in Theorem 3.1) are outstanding open questions. Also, it is not known 
whether the class of modified line graphs of cross-perfect bipartite graphs is a 
new class of perfect graphs. 
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Abstract. Consider a graph G = {V,E) with an [a, 6]— factorization 
T = {Fi,F2, ..., Fm}. It is proved in this paper that: 

1. there is an m— matching of G to which T is orthogonal if n = 
|F(G)| > (2 + ^)(m — 1) for b > 2a and n > 3.26m for h = a\ 

2. if < a < b, then for any given edge e of G, there is a [1, a]— sub- 
graph H of G such that e is included in H and T is orthogonal 
to H. 



1 Introduction 

All graphs considered are undirected graphs without loops and without mul- 
tiple edges. Let G be a graph, we denote by V{G) and E{G) the vertex set 
and edge set of G respectively. Denote by da{v) the degree of v in G. Let a 
and b be two integers. An [a, 6]— factor of G is a spanning subgraph F of G 
such that a < dp{v) < b for every v € V(F). In particular, if a = b = k, 
then an [a, 6]— factor is a fc— factor. For simplicity and in case when there is no 
ambiguity, we denote a subgraph and its edge set with the same symbol. An 
[a, 6]— factorization F = {Fi, F 2 , ..., F„} of G is a partition of F(G) into edge 
disjoint [a, 5]— factors Fi, F 2 , ..., F^. Similarly, we define A— factorization as a 
partition of F(G) into fc— factors. Let E be a subgraph of G with m edges and 
T = {Fi, F 2 , ..., Fm} an [a, 6]— factorization of G, then T is called orthogonal 
to H if each Fi,l < i < m has exactly one edge in common with FI. Other 
definitions and terminologies can be found in [2]. 

Alspach B. posed the following conjecture: 

Conjecture 1. [1] Let G be a 2d— regular graph, F = {Fi, F 2 , ..., F^} be a 
2— factorization of G, then there exists a d— matching M of G to which T is 
orthogonal. 

G.Z.Liu et al. [4] proved that the conjecture is true when |K(G)| > 4d — 5 
and extend the question to fc— factorizations and [1, A]— subgraphs. Later, M. 
Kouider et al. [3] improved the result of [4] to |K(G)| > 3.23d. 

In this paper, we improve the above results and extend them to [a, 6]— factori- 
zation. The main techniques of this paper come from those in [4] and [3]. 

* Research is partially supported by a grant from the Research Grants Council of 
Hong Kong SAR (CityU 1074/00E) and a grant from CityU of Hong Kong (Project 
No. 7001215). 
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Let A and B be two disjoint subsets of V(G), then for any 1 < i < m, denote 
by Ei{A,B) the set of edges of factor Ft with one end in A and the other in 
B, by Ei{A) the set of edges of factor Fi with both ends in A. Let fci , kj be 
j different indexes between 1 and m, then denote by Eki,...^kj{A, B) the set of 
edges of all the factors F^^, ■■■,Fk^ with one end in A and the other in B. 

2 Main Theorems and Proofs 

Theorem 1. Let a, b and m he positive integers with a < b. Let G he a graph 
with an [a, b]— factorization T = {Fi, F2, Fm}- Then there is an m— matching 
of G to which T is orthogonal if n = |T(G)| > (2 + y)(m — 1) for b > 2a and 
n > 3.26m for b = a. 

Proof. We apply induction on m. When m = 1, the theorem is obviously true. 
Suppose the theorem is true for m — 1. We will prove that the theorem holds for 
m > 2. By induction hypothesis, there is an (m— 1)— matching M' of G to which 
T' = {Fi, F2, ..., Fm-i} is orthogonal. Assume that there is no m— matching of 
G as described in theorem 1. We will see a contradiction. 

W.l.g., suppose M' = {ei, 62, .., Cm-i}, where Ci = Xiiji G F^, 1 < z < m — 1 . 
Let A = {xi,yi : 1 < i < m — 1}, B = V{G) \ A. Then \A\ = 2{m — 1 ), 
\B\ = n — 2(m — 1 ). Since there is no m— matching of G satisfying the conclusion 
of theorem 1 according to our assumption, we have that Em{B) = <j). Then 
Em = Em{A,B) + Era{A). 

We now reindex the edges in M' and let s denote the maximum index subject 
to the following two conditions, 

1. there exist two distinct vertices x\ and y'^ in B satisfying that x\x'-^ and yiy[ 
are edges in F^; 

2. for each index z between 2 and s, there exist two distinct vertices x[ and y' 
satisfying that Xix'^ and z/zy' are edges in Fz_i. All the {x',y'}(z > 1) are 
disjoint. 

Consider the subset S = {ei,...,eg} of M'. We claim that S is nonempty. 
For there are always some edges, say ej = xjyj in M' which are adjacent to 
two independent edges in Em{A,B). Otherwise, for each edge Cj = Xjpj G M', 
either the edges in Fm{A,B) are only incident with one end point of Cj or both 
the edges in Fm{A, B) incident with Xj and yj end in the same vertex in B. 
Therefore, we always have a|F| < h\M'\ = 6^, and subsequently we have 

rz = |A| + \B\ < (1 + A)|yi| = (2 + _ 1), (1) 

a contradiction to our assumption. 

For each index k between 1 and s, denote by Ak the set of the ends of 
ei, 62, ..., 6fc, by Aj. the complement of Ak in A; denote by Bk the set of x', y', 1 < 
i ^ k, hy B'f. the complement of Bk in B. Then we have: 

|Afc| = \Bk\ = 2k, \A'k\ = 2{m-l-k),\B'k\=n-2m + 2- 2k. (2) 
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Since according to our assumption there is no matching of G to which T is 
orthogonal, we have 

Es{As, B's) = Es{Bs, B'g) = Es{B's, B'g) = 0 . ( 3 ) 

Besides, due to the maximality of s, similar as the discussion before, any edge x 
with X G EM'{A'g) is adjacent to at most one independent edge in Es{A'g, B'g). 
Therefore we have: 

\B's\ < ^\K\, (4) 

which implies that: 

n = \B' \ + 2m — 2 + 2s < — \A'\ + 2m — 2 + 2s = (2H — ){m— 1) + (2 )s. (5) 

2a a a 

Similarly, we have: 

Ek,k+i,...,s{Ak, B'l^) = 0 , 

Ek,k+i,...,s{Bk, B';.) = 0 , 

Ek,k+i,...AB'k,B',j) C {x'A\k + l <i<s}, (6) 

Bk^k+i,...,s{Ak U Bk-, Ak U Bk)^ 



c 


{x^y[,y,x'i, 


x'iy'zl 


l<i< 


A:} U {cfc}. 




Subsequently we have: 












\Ek,k+i,... 


,s{Ak, A'A 


> a 


|Afc|(s — 


A: + 1) — \Ak\ - 


-2, 


\Ek,k+i,... 


.,s{Bk, A'fS)\ 


> a 


\Bk\(s- 


A:+l)-2|Bfc| 




\Ek,k+i,... 


AB'kAA 


> a 


\B'kA- 


A: + 1) — 2(s — 


’k). 


And therefore. 












\Ek,k+i,.. 


,AA,AkAB)\ 


> o(s - 


- k + l)(|Afc| + 


\B\) 








-(6A; + 


2)-2{s-k). 





On the other hand, counting the edges in Ek^k+i,...s{A'^t Ak U B), we have that: 
\Ek,k+i,...,s{A'j., Ak U -B)| < 6|A^|(s — A: + 1) — 2(s — fc). (9) 

Subsequently we have, 

— A: + 1) — 2(s — k) > a{s — k + l)(|^fc| + |i?|) 

-(6A: + 2) -2(s-fc). ^ 



Therefore, 

a{s — k + l){2k + n — 2m + 2) — 6(s — k + l)(2m — 2 — 2k) < 6A: + 2. 
Hence, we have: 

^ 2m{a + b) 2{a + b) 2k{a + b) ^ 6k + 2 
~ a a a a{s — A: + 1) 

Now we have two cases to discuss. 



( 11 ) 



(12) 
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Case 1. b > 2a. 

From (5) we have that 



n < (2 + -)(m — 1), (13) 

a 

a contradiction to our assumption. 

Case 2. b = a. 

From (5), we have n < 3m — 3 + s. Now, if s < 3, then n < 3m < 3.26m, 
a contradiction; if s = 4, then m > s + 1 = 5 and n < 3m + 1 < 3.26m, still a 
contradiction. Therefore, in the following discussion, we assume that s > 5. 

Set fc = s — 4 in equation (12), then we have : 

(14) 



(15) 

From (5), we have s > n — 3m + 3 and subsequently we have 



n < 



2m{a + b) 2{a + b) 2k{a + b) 6(s — 4) + 2 



a a a 

which is equivalent to the following equation: 



5a 



r- 10^ 6, 22 30& 

5n < (10 H )m — (10 H )s + 30 1 . 

a a a a a 



5n < (10 + 



106, 

)m 

a 



(10 + +: 

a 



6 22 

-)(n — 3m + 3) + 30 

a a 



306 

a 



Therefore, 



n < 



40a + 406 - 18 
15a + 106 — 6 



4 

< 

a 



80a - 18 
25a- 6 



m < 3.26m, 



a contradiction to our assumption that n > 3.26m. 
Thus completes the proof of Theorem 1 . 



(16) 



(17) 

□ 



Corollary 1. Let G be a kd— graph with vertex number n > 3.26d and let T = 
{Fi,F 2 , be a k— factorization ofG. Then G has a d— matching to which 

T is orthogonal. 

Proof. We can prove this corollary by setting a = b = k in Theorem 1. □ 

Corollary 1 greatly improved G.Z.Liu et al. ’s results concerning orthogonal 
A:— factorizations in [4]. 

Theorem 2. Let a, 6 and I be integers such that < a < b. Let G be a graph, 
T = {Fi, F 2 , ..., Fi} be an [a, b]— factorization ofG. Then for every given edge e 
of G, G has a [1, a]— subgraph H such that e € H and to H F is orthogonal. 
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Proof. We use induction on 1. When / = 1, the theorem is trivial. Suppose the 
theorem is true for ^ < m — 1, we will prove it is true for I = m. 

W.l.g., suppose e ^ Fm- By induction assumption, G has a [1, a]— subgraph 
p[', such that e £ H' and that to H' , T' = {Fi, F 2 , ■■■, Fm-i} is orthogonal. 
Suppose G has no satisfying [1, a] —subgraphs, we will see a contradiction to our 
given conditions of theorem 2. 



Let 

S = {xi, Hi : XiUi G E{H')}, T = V(G)\S. 


(18) 


Let 

Si = {x £ S and dH>(x) < a}, 
Sa = {x £ S and d,H'{x) = a}. 


(19) 


Since G has no satisfying [1, a]— subgraphs, we have that = 

On the other hand, Em{T,Si) = 0, Em{T,T) = 0. Therefore, 


EUSa,V{G)). 


E^{T,V{G)) = E.^{T,Sa), 


(20) 


and subsequently, 




\E^{T,V{G))\ < \E^\ = \E^{Sa,V{G))\. 


(21) 


Notice that 




\E^{T,V{G))\ > a\T\, \E^{Sa,V{G))\ < b\S,\, 


(22) 


we have 

a|T| < \E^{T,V{G))\ < \E^{Sa,V{G))\ < 6|^a|, 


(23) 


and subsequently we have 

7 




|r|<-|^a|. 


(24) 


Now we have two cases to discuss: 




Case 3. a = 2. 

In this case, Ff' is a [1, 2]— subgraph of G. Hence, 




2(to-1) = |^z|+ 2(1^1-15,1). 


(25) 


Then we have 




|5,| = 2(|5| - to) + 2, \Sa\ = |5| - |5,| = 2to - |5| - 


2. (26) 


Notice that 

h h 




\T\ = n-\S\<^\Sa\ = ^{2m-\S\-2), 


(27) 


we have 

26 26 26 
n< — TO (“)|5| < — TO. 


(28) 



On the other hand, since n > ma + 1, we have 

1 26 

ma + 1 < — TO, 
a 

which implies a < a contradiction to a > v^. 



(29) 
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Case 4- a > 3. 

Since H' has m — 1 edges, we have a|S'a| < 2(m — 1) and therefore |S'a| < 

0( fj~j 1 ^ }) 

. Recall that \T\ < -|5'a|, \T\ = n — [S'!, we have 



n<\S\ + -\Sa\<2m-2+ = (2 + ^)m - 2 - (30) 

rt^ 



a 












r- 2b 

Since a > V2b, we have ^ < 1 and therefore n < 3m — 2 < ma + 1, a 
contradiction to n > ma + 1 . 

Theorem 2 is proved. □ 



Remark 1. We can derive the results concerning [1, k] — subgraph in [4] by setting 
a = b = k in Theorem 2. 
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Abstract. In a systematic erasure code for the correction of two si- 
multaneous erasures, each information symbol must have two associated 
parity symbols. When implemented in a redundant array of independent 
disks (RAID), performance requirements on the update penalty neces- 
sitate that each information symbol be associated with no more parity 
symbols than the two required. This leads to a simple graph model of 
the erasure codes, with parity symbols as vertices and information sym- 
bols as edges. Ordering the edges so that no more than / check disks 
(vertices) appear among any set of d consecutive edges is found to opti- 
mize access performance of the disk array when d is maximized. These 
cluttered orderings are examined for the complete graph Kn- The max- 
imum number d of edges is determined precisely when / < 5 and when 
/ = n — 1, and bounds are derived in the remaining cases. 



1 Background 

Let G = (V,E) he a, graph on n = |U| vertices and m = \E\ edges. Let d < m 
be a positive integer, called the window. Let E = {cq, ei, . . . , Cm-i}- An edge 
ordering of G is a permutation tt of the edge indices {0, 1, . . . , m — 1}. For the 
graph G with edge ordering tt, and window d, define the m + 1 — d graphs 
{G^’‘^ : 0 < i < m — d} hy setting G^’'^ to be the graph containing edges 
. . . , Then, when sets of d edges that are consecutive 

under the ordering tt are accessed, the graphs {G^’’^} represent the possible 
subgraphs accessed. 

In the application to be described, the cost of accessing a subgraph of d con- 
secutive edges is measured by the sum of the number of edges and the number of 
vertices of nonzero degree in the subgraph. Since each has d edges, any reduction 
in access cost results from varying the numbers of vertices. Hence we define n^’'^ 
to be the number of vertices of nonzero degree in The (average) d-access 

cost of graph G under ordering tt is defined to be ^ • The d-maximum 

access cost of graph G under ordering tt is defined to be max™g'^n^’‘^. Maximum 
access cost and average access cost measure different aspects of performance, 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 420-431, 2001. 

© Springer- Verlag Berlin Heidelberg 2001 
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but evidently address similar goals. An ordering is (d, /) -cluttered if it has d- 
maximum access cost equal to /. Let md(G) denote the minimum value of /, 
over all edge orderings tt of G, of the d-maximum access cost. It may happen 
that md{G) = md+i{G) (see [7], for example). We write c/(G) for the maximum 
value of d for which a (d, /)-cluttered ordering of G exists. When G = AT„, we 
use the notation md{n) and Cf{n). 

In this paper, we develop two combinatorial constructions for cluttered or- 
derings of the complete graph The first is for small windows (§3) and the 
second for large windows (§4). Each is limited to specific parameter selections 
for d, /, and n, and hence may not provide the required solution for parameters 
arising in a particular application. However, we derive a powerful extension of 
the combinatorial labelling technique for small windows which dramatically in- 
creases its application. In section 5, we also develop an inflation technique that 
permits us to obtain orderings for specific parameter situations from solutions 
for smaller cases. 

2 An Application to RAID 

Secondary storage devices play an important role in system performance. Im- 
provements in magnetic disk speed have not kept pace with improvements in 
CPU performance. Current systems therefore have the potential to be I/O 
bound. This, coupled with the requirement for high reliability and availabil- 
ity, has resulted in use of redundant arrays of independent disks (RAID); see 
[3,9,11,12,13,14]. RAID maps larger disk reads or writes to multiple smaller 
reads and writes. It splits (stripes) these across several disks in parallel along a 
parity stripe [16]. Redundant information is maintained in order to reconstruct 
the array in the case of disk failure. Parity computations are employed to retain 
the redundant information. 

A catastrophic failure renders a disk unreadable; this is an erasure. A code 
that corrects for t erasures is a t-erasure code. One important metric in erasure 
codes for disk array architecture is the update penalty. This is the number of 
disks whose contents must be changed each time a write occurs on a single disk 
in the array. In a t-erasure code, the minimal update penalty is t. 

In the schemes that we examine, there are c -\- k disks. Within each stripe 
across these disks, k positions are reserved for information while c express parity 
functions of the information. The code used is systematic, because information 
is expressed in an unencoded format. The equations for an erasure code using 
parity for redundancy in disk arrays can be represented as a a parity check 
matrix, H = [P|/]. This is a c x (A: -|- c) matrix where the columns of P are 
indexed by the information disks, the columns of I are indexed by check disks 
and the rows of P and I by the check disks. I is the c x c identity matrix. A disk 
failure is recoverable if and only if the failed columns are linearly independent 
modulo 2 [10]. 

In this setting, for each write to an information disk, we must also update each 
of its check disks. Writes are expensive operations in a disk array. Update penalty 
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is a useful metric, but it fails to describe the observed write penalty. Consider 
writing information to the array. One method is to write to each information 
disk to be updated; then each of the information disks in their equations is read, 
in order to calculate the new parity; and lastly the changed parity is written 
to each of the check disks. This is unnecessarily expensive when writing to a 
small fraction of disks in an array. For large disk arrays with multiple erasure 
protection, when the number of disks to be written is relatively small compared 
with the number of information disks, all writes are implemented as a read- 
modify-write (see [6,11]). To write to a disk, we only need to calculate the change 
to its check disks, before updating them. Therefore we read all of the check and 
information disks involved. Then the change to the parity is computed, the check 
disks are written and the new data is written. For a write involving only one 
information disk in a 2-erasure code, this requires six I/O accesses. The actual 
penalty for an update penalty of t is 2t -|- n accesses where n is the number of 
information disks. For small writes, this is less expensive than reading the entire 
stripe. 

Our previous work on 3-erasures shows that the ordering of the parity check 
matrix may reduce the observed update penalty when writing across three disks 
at a time [5,6]. Since information disks share check disks, we can improve the 
observed penalty if we impose an ordering and assume some intermediate buffer- 
ing or parity logging (see [5,17]). If data is striped sequentially along disks in 
a parity check matrix and these disks share check disks, the actual number of 
reads and writes can be reduced. 

The connection to cluttered edge orderings is as follows. Columns of the 
parity check matrix contain two ‘1’ entries, and hence if we interpret check disks 
as vertices, every information disk is associated with an unordered pair of check 
disks, i.e. with an edge. The parity check matrix therefore represents a graph; 
indeed if it does correct two erasures, the graph has no multiple edges. Therefore 
the maximum number of information disks that can be employed corresponds 
to the selection of the complete graph to form the parity check matrix. If we 
label the first h columns of the parity check matrix with h distinct pairs on c 
vertices and then label the identity portion of the matrix with h + 1 through 
h + c we form a parity check matrix. If we then order the h columns using a 
cluttered ordering of the corresponding graph, and stripe data sequentially, this 
minimizes the maximum number of check disks affected by a write operation. 



3 Wrapped Labellings 

In this section, we develop a technique for producing specific (d, /)-cluttered 
orderings of when / is relatively small. We define a graph G{h, w) on h(r<;-|-l) 
vertices. The vertex set is 14 = ■ 0 < i < h,0 < j < w}. The edge set is 

E = {{vij,Vkj} : 0 < i < k < h,0 < j < w} U {{vij,Vk,j+i} : 0 < k < i < h,0 < 
j < w}. There are w ■ edges; indeed, G{h,w) has a natural edge-partition 
into w copies of G(h, 1). A p-labelling of a graph G = (V,E) is a mapping 
p ■. V ^ 'L 2 \e\+i for which {±{p{r) — p{s)) : {r, s} G E} contains all nonzero 
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Fig. 1. Wrapped p-labelling of G(3, 2) 



numbers in 1:2\e\+1j see [15]. These p-labellings have been extensively studied 
as methods of partitioning (or decomposing) the edge set of the complete graph 
K 2 \e\+i into copies of G [1,15]- Under suitable conditions, they can also be used 
to constrain the graphs induced among all sets of [if | consecutive edges; a simple 
example is given in [7]. We pursue a more general formulation next. 

3.1 Wrapped Labelling of G{h, w) 

A p-labelling of G{h, w) is a wrapped p-labelling if it satisfies {a + p{vio) : 0 < 
i < h} = {p{viw) ■ 0 < i < h} for some a which is relatively prime to 2|if| + 1. 
Figure 1 shows a wrapped /O-labelling for G(3, 2); here a = —2. 

Proposition 1. If G{h,w) has a wrapped p-lahelling, then there is an order- 
ing of the edges of K 2 wh^+i in which every set of hf consecutive edges in- 
duces a subgraph on at most 2h vertices, i.e. a {h^ ,2h)- cluttered ordering. Thus 
m}pi{2whf + 1) < 2/i and C 2 h{‘ 2 -wh^ + 1) > 

Proof. Let a + p be the labelling obtained by adding a to each label assigned 
by p, reducing modulo 2wh? + 1. Starting with (Ghw,p), we can identify the 
vertices {viw} with the vertices {uio} of {Ghw, o + p) so that identified vertices 
have the same label (since the labelling is a wrapped p-labelling). Repeating 
this addition of a, we encounter every translate of the labelling p since a and 
2wh?-\-l are relatively prime. Indeed, in the process, every edge of K 2 wh'^+i arises 
in exactly one of the translated graphs constructed, since every edge difference 
arises exactly once in p. An example, adjoining the first four translates of G(3, 1) 
with wrapped p-labellings p, 1 -I- p, 2 -|- p, and 3 -I- p, is shown in Figure 2. 

Our task is now to order the edges of G{h, 1). We first list the ( 2 ) edges on 
{uio} in any order. Then we start with {t’ooji’oi}; {vio,uoi}, {uio,un}; and in 
general, {ujo, "Coi}) • ■ • ) In other words, we list the edges of the form 

{vio,Vji} so that {vio,Vji} precedes {vko,Vi\\ if and only \i i < k, or i = k 
and j < 1. If, within every copy of G(/i, I), edges are ordered in this manner, 
then we claim that every set of h? consecutive edges induces at most 2h vertices. 
Consider sliding a window of size h? through the first and second copies of 
G(h, 1). Initially there are 2h vertices. The first ( 2 ) transitions move the edges 
on {vio} to the corresponding edges on {uii}; no new vertices are encountered. 
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Fig. 2. First four translates of G(3, 1) 




Fig. 3. First nine windows for G(3,2) 



The next transition drops the only edge still incident at vqo and adds an edge 
to vo2- The next edge introduced involves vq2, so no new vertex is seen. Then 
the last edge incident at uio is dropped, and an edge to Vi 2 added. Carrying this 
on, whenever a new vertex is to be introduced, say Vi 2 , the last remaining edge 
incident with Vio is being dropped. Thus after transitions, the window contains 
the second copy of G(h, 1) in the G(h, w). See Figure 3 for an illustration of the 
first nine windows (arranged horizontally) of the ordering of G(3, 2). Transitions 
into all copies of G(h,l) are effected in this way. When the first G{h,w) is fully 
seen, since the second G{h, w) is attached to it as described above, we can simply 
continue with the same argument. Thus every set of consecutive edges induces 
at most 2h vertices. □ 

Enlarging the window provides a further useful result. Consider t-h? + i^~ 2 ^) 
consecutive edges in the ordering. The argument in the proof establishes that at 
most {t + l)h + X vertices are involved in these edges, and hence: 
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Fig. 4. F(3, 2) and its first translate 



Proposition 2. IfG{h,w) has a wrapped p-labelling, then > 

th^ + (^2 for all t>l. 

We show some h x (w + 1) boxes giving wrapped p-labellings (found by a 
backtracking algorithm) for G{h,w): 




3.2 Wrapped p-Labelling of r{h,w) 

A very similar approach enables us to treat windows which induce an odd number 
of vertices. We form a graph r{h, w) which contains all of the vertices and edges 
of G{h,w), along with one further vertex oo, and further edges {{oo,w^} : 0 < 
i < h,0 < j < w}. r{h, w) has wh{h+l) edges. A wrapped p-labelling of r{h, w) 
is a labelling for which {a+p(oo)}U{a;+p(uio) : 0 < i < h} = {p(oo)}U{p(u™) : 
0 < i < /i} for some a relatively prime to 2wh{h + 1) + 1. 

The edge ordering employed for G{h, w) is extended to r{h, w) by placing all 
edges involving oo at the beginning. The property of being wrapped ensures that 
the vertices {oo}U{uio ■ 0 < i < h} have the same labels in (r{h,w), p) as those 
of {oo} U {viw : 0 < i < hj in {r{h, w), a + p), so that an identification of these 
vertices can be made as for wrapped p-labellings. Figure 4 shows a wrapped 
p-labelling of T(3, 2), adjoined to the first translate with a = —1. The vertex 
labeled 0 is oo in the first T(3, 2), while the vertex labeled 48 is oo in the second. 

Proposition 3. If F{h, w) has a wrapped p-lahelling, then there is an ordering of 
the edges of K 2 wh(h+i)+i in which every set ofh{h-\-l) consecutive edges induces 
a subgraph on at most2h-\-l vertices, i.e. a {h{h-\-l),2h-\-l)-cluttered ordering. 
Thus mf,(h-\-i){‘^njh{h -I- 1) -I- 1) < 2h-\- 1 and C 2 h+i{‘^wh{h -I- 1) -I- 1) > h{h 1). 
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This extends in the same manner as Proposition 2, to establish: 

Proposition 4. If r{h,w) has a wrapped p-lahelling, then C(t+i)h+x+i{‘^wh{h + 
1) + 1) > th{h + 1) + for all t > 1. 

In the wrapped p-labellings to follow, we always take p(oo) = 0. 





Cohen and Colbourn [7] treated a restricted version of this problem, namely 
finding wrapped p-labellings of T(l, w). They settled this problem completely in 
the affirmative, and established more. When d = 3, the access cost for a specific 
subgraph is at least 3, and at most 6. When three consecutive edges induce 
a triangle, we have access cost 3 and this is the only situation accounting for 
this minimum. However, when {e^, Cj+i, 6^+2} induces a triangle, {cj-i, Cj, e^+i} 
cannot form a triangle unless n = 3, for ei_i = e^+2 is necessary if both are 
triangles. Now when three consecutive edges do not form a triangle, the fewest 
vertices induced is four, which can be realized by a path or a star on three edges. 
Hence the minimum 3-access cost of Kn is at least 3.5 when (2) = 0 (mod 2), 
and is at least 3.5 — „2_^„_4 when (”) = 1 (mod 2). 

An edge ordering for AT„ which realizes this minimum when d = 3 is then 
sought. The specific question addressed is: When can the (2) edges of the com- 
plete graph Kn be ordered by a permutation tt, so that among the (2) — 2 
subgraphs at least |(n^ — n — 6) subgraphs form triangles? Such an 

ordering of the edges of is a ladder ordering of pairs. When (”) is even, this 
requires that for any three consecutive edges e^, Cj+i, 6^+2 with 0 < i < (2) ~ 3, 
the subgraph induced by these three edges contains three vertices when i is even, 
and four vertices when i is odd. A wrapped p-labelling of G(l, w) gives a ladder 
ordering of pairs for n = 4w + 1. Cohen and Colbourn [7] prove: 

Theorem 1. A ladder ordering of pairs for Kn exists except possibly when n G 
{15,18,22}. 

This has an immediate application to the determination of maximum access 
cost: 

Proposition 5. There is an edge ordering of Kn with (2 f — 4) -maximum access 
cost less than or equal to f. Hence, with the stated exceptions for n, the largest 
d for which rrid{n) = f is at least 2/ — 4, i.e. Cf{n) > 2/ — 4. 
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Proof. A ladder ordering of AT„ employs at most / vertices among any 2/ — 4 
consecutive edges, for all /. It remains only to treat the case when / = 3, which 
is handled in [7]. □ 

We next develop a simple lower bound for c/(n). 

Proposition 6. For / > 3, c/(n) < {^ 2 ^) + 1- Hence 03(71) = 2 and 04(77) = 4. 

Proof. Consider a {d, /)-cluttered ordering tt on G = AT„. It must happen that 
for some value of i, and are on different sets of / vertices. But G^'^ 
has at most one edge that does not appear in G^J^^ , and hence contains a vertex 
of degree one. It follows that Gj’‘^ can contain at most (^2 + 1 edges. □ 

This can be refined to prove that 05(77) = 6, as follows. If an ordering tt of K„ 
had 7-maximum access cost equal to five, then by the argument given, we can 
assume that G^’‘^ and G^J^i share a K 4 . Upon considering the vertices of G^^^ 
for r = 2,3,4, since no new vertices can be introduced, we find that the three 
edges e,r(i-i-2) Etnd e,r(i-i-3) share a common vertex which does not appear 

in Gf+l (otherwise, the vertex y added in forming has become saturated 

and no more edges can be placed). 

A more involved argument establishes that 05(77) < 9, but a general bound 
is elusive. 

3.3 A Generalization 

There is no wrapped p-labelling of G(/i, 1) for h G {6, 7, 8, 9}. However, removing 
the edge {050,051) from Go,i, we can find a wrapped p-labelling of the resulting 
graph, with p(ooo) = p(on) - 1 = 27, p(oio) = p(o2i) - 1 = 18, p(o2o) = 
p(03i) - 1 = 41, p(o3o) = p(04i) -1 = 7, p(o4o) = p(o5i) -1 = 3, and p(o5o) = 
p(ooi) — 1 = 0. By the reasoning of Proposition 1, a (35, 12)-cluttered ordering 
exists for Kyi. To generalize, suppose that we are to form an (e, 2f)-cluttered 
ordering of A2e+i and that e < We could begin with G{£, 1), and preserve 
any e of the edges. Then if the resulting graph has a wrapped p-labelling, the 
ordering is easily generated. The only potential difficulty here is in determining 
which e edges to retain. Instead, we can label the vertices of G{£, 1), not removing 
any edges, so that every nonzero difference modulo 2e -I- 1 is represented as 
the edge difference on at least one edge; call this a wrapped near-p-labelling of 
G{£, 1) modulo 2e -|- 1. If a wrapped near-p-labelling exists, we can omit edges 
one by one whose difference is multiply represented, until every difference is 
represented exactly once. Thus we are asking, in general, for wrapped near-p- 
labellings modulo v for v < 2£'^ + 1. Indeed, there is no requirement that the 
modulus be odd; when it is even, an edge difference of half the modulus generates 
two copies of each edge, and hence half the edges of this difference can be omitted 
and the other half retained. A wrapped near-p-labelling modulo 2e -I- 2 yields 
a cluttered ordering for e edges. Space does not permit a full exploration here, 
so we content ourselves with some examples. G(7, 1) has no wrapped p-labelling 
(modulo 99), but we have found wrapped near-p-labellings for every odd modulus 
from 51 to 93 inclusive, and for every even modulus from 52 to 88 inclusive. This 
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establishes that for 25 < e < 46, there is an (e, 14)-cluttered ordering of K 2 e+i, 
and for 25 < e < 43, there is an (e, 14)-cluttered ordering of K 2 e+ 2 - 

The labellings all have a similar structure, with p{vi+i^i) — 1 = p(vifi) for all 
i, and p(u 6 ,o) = 0- We therefore present only the labels on {xio} for 0 < i < 5 
in the following: 



V 


p{vio) 


V 


p{vio) 


V 


p{vio) 


V 


P(Vio) 


52 


2 


4 6 10 37 25 


53 


2 


4 


6 8 12 38 


54 


2 


4 


6 11 26 39 


55 


2 4 


6 8 15 26 


56 


2 


4 6 11 40 27 


57 


2 


4 


6 8 16 27 


58 


2 


4 


6 12 28 42 


59 


2 4 


6 8 17 28 


60 


2 


4 6 12 43 29 


61 


2 


4 


6 8 18 29 


62 


2 


4 


6 13 30 45 


63 


2 4 


6 8 19 30 


64 


2 


4 6 13 46 31 


65 


2 


4 


6 8 19 41 


66 


2 


4 


6 14 32 48 


67 


2 4 


6 8 21 32 


68 


2 


4 6 14 49 33 


69 


2 


4 


6 11 33 56 


70 


2 


4 


6 15 34 51 


71 


2 4 


6 11 57 34 


72 


2 


4 6 15 52 35 


73 


2 


4 


6 12 59 35 


74 


2 


4 


7 17 54 44 


75 


2 4 


6 15 26 44 


76 


2 


4 13 20 43 51 


77 


2 


4 


6 17 26 46 


78 


2 


4 13 44 53 60 


79 


2 4 


7 26 36 67 


80 


2 


4 14 21 45 54 


81 


2 


4 


8 27 69 38 


82 


2 


4 46 55 15 22 


83 


2 4 


8 38 70 28 


84 


2 22 5 15 57 47 


85 


2 


4 


8 28 39 72 


86 


8 28 71 3 51 41 


87 


2 4 33 71 11 47 


88 


10 13 18 62 34 57 


89 


2 


6 18 36 27 50 


91 


2 


9 30 68 14 49 


93 


2 9 14 31 69 50 



This strategy extends naturally to G{h,w) and to r{h,w); when ru > 1, we 
insist that when edge Vj^k+i\ is removed, all edges of the form {vi/, 
for 0 < £ < w are removed, in order to ensure that the resulting ordering is 
(e, 2h)~ or (e, 2h + l)~cluttered. 



4 Large Windows 



In this section, we examine the opposite extreme, when all but one vertex is 
induced within a window. An upper bound is easily proved: 

Proposition 7. c„_i(n) < [( 2 ) 72 ] — 1. 

Proof. We must establish that when d = [( 2 ) 72 ], every ordering tt has d- 
maximum access cost equal to n. Suppose to the contrary that there is an or- 
dering 7 T with cost less than n. Let G = Kn, and consider Gg’”^ and 

These graphs must have a different isolated vertex, since at most one edge of 
K„ fails to appear in either; suppose that 0 and 1 are, respectively, the missing 
vertices. When (”) is even, all edges appear in one of the two graphs and yet 
{0,1} appears in neither, which is a contradiction. So it remains to examine 
the case when ( 2 ) is odd. Then (0, 1} must be graph G^' 

for 1 < z < d contains the edge { 0 , 1 } and hence must miss a different vertex. 
Suppose that G{’ is missing vertex x, and G^’ is missing y. It cannot happen 
that X = 0 , or all occurrences of 0 he among the last d positions and hence all 
vertices appear there as well. By the same token, we hnd that {x, z/| fljO, 1} = 0. 
Since Gg’*^ misses 0 and G^’*^ misses y, the last edge in the ordering must be 
{0,z/|; in the same way, the hrst must be |l,x}. If {x,y} is an edge, there is 
then no position in which to place it in the ordering. Therefore x = y. But then 
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since and G^’'^ both miss x, the total number of occurrences of x in edges 
is at most two, the final contradiction. □ 

Now we prove a matching upper bound. 

Proposition 8. c„_i(n) > [( 2 ) 72 ] — 1. 

Proof. We define an ordering tt on the edges of iC„. Let 0, 1, 2, and 3 be specific 
vertices of and let A be the set of remaining n — 4 vertices. Edges are ordered 
as follows: {2,3}, {0,2}, {0,3}; {3, a} for a G A; {0,a} for a G A; X; {0,1}; Y; 
{l,a} for a G A; {2, a} for a G A] {1,2}, {1,3}. Here X and Y partition the 
remaining edges arbitrarily, subject to the condition that Y contains at least as 
many edges as does X, and contains at most one more edge. It is then easily 
verified that the graphs G^’"^ with d = [( 2 ) 72 ] — 1 miss first vertex 1 , then 2 , 
then 3, and finally 0. □ 

For n = 8, here is an example ordering: 23, 02, 03, 34, 35, 36, 37, 04, 05, 06, 
07, 45, 46, 47, 01, 56, 57, 67, 17, 16, 15, 14, 27, 26, 25, 24, 12, 13. All sets of 13 
consecutive edges only involve seven of the vertices. In fact, the only set of 14 
consecutive edges involving all eight vertices is the last. 

5 Inflation 

While the techniques of the two preceding sections often give useful cluttered 
orderings for the RAID application, they can be difficult to apply to specific 
values of n and d. We therefore develop a simple technique, inflation, for pro- 
viding reasonable orderings in cases where our earlier techniques do not apply 
directly. Suppose we wish to produce an edge ordering for AT„, and we have 
{d, /)-cluttered orderings for a collection of smaller complete graphs. If there is 
a (d, /)-cluttered ordering of Kt for t < n, write n = st + r, with 0 < r < t. 
Form a complete graph K„, and partition its vertices into t classes, each having 
size s or s -I- 1. Call the classes Xi, . . . , Xs- For each 1 < i < s, the edges within 
class Xi are partitioned into t — 1 classes, Aq, . . . , /yt-i, so that the number of 
edges within the edge classes are as equal as possible. Then each has at least 
— l)\edges. Now form a (d, /)-cluttered ordering of Kt on vertex set 
{1, . . . ,t}. We order the edges of AT„ as follows. For each edge {i,j} of Kt in 
turn, we list all edges of K„ with one endpoint in Xi and the other in Xj (in 
any order). Then if this is the ^th occurrence of i in an edge, we list the edges 
in li^i. We repeat this for j. The resulting ordering of AT„ has ‘inflated’ every 
edge of Kt into a collection of at least -I- 2[(2) /{t — 1)J edges. Hence we have 
proved: 

Proposition 9. If a {d, f) -cluttered ordering of Kt exists, a {d' , f) -cluttered 
ordering of Kgt+r <ilso exists, where /' = min{sf f,sf -\- r) and d' > (s^ -I- 

2L©7(t-i)J)-d. 

A very special case arises when n = t -\- 1. Then we find that Cf(n) > 
c/_i(n-l). 
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We do not expect inflation to yield edge orderings which actually minimize 
the number of vertices induced. Rather, it affords us a method for employing 
the specific solutions found by combinatorial methods (as in earlier sections) to 
produce acceptable orderings for a large range of relevant parameters. 

6 Concluding Remarks 

The impact of improved edge orderings on performance is treated in [8] . In this 
paper, we have focussed on combinatorial methods for producing cluttered order- 
ings. We extended the notion of ladder orderings (which handle very small reads) 
to larger reads by establishing the existence of numerous wrapped labellings. The 
specific examples produced are of value in erasure codes of practical size. We also 
established an exact result when all but one of the vertices are induced within a 
window. Finally, we described a simple technique for employing existing order- 
ings for small complete graphs to develop orderings for larger complete graphs; 
again, this can be expected to have practical value in finding reasonable (but 
almost certainly not optimal) orderings for particular parameters in a RAID 
system. 

We have treated only the complete graph here, and it is natural to ask about 
orderings of other graphs as well. In the RAID application, we expect that sim- 
ply removing the trailing edges from an ordering provides an erasure code with 
fewer information disks that enjoys similar performance benefits. However, con- 
siderations such as reliability and balanced ordering (see [10]) may argue against 
the omission of those particular edges; hence it may be of interest to extend the 
techniques here to handle a broader class of graphs. 
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Abstract. Stream merging is a promising technique for reducing server 
bandwidth in video-on-demand systems. There are many heuristics for 
the problem proposed whose effectiveness has been conhrmed empirically. 
However, it is desirable to prove their effectiveness mathematically. In 
the pioneering work [2], Bar-Noy and Ladner studied stream merging 
using competitive analysis. They designed an 0(log n)-competitive on- 
line scheduler, where n is the total number of stream requests. However, 
their result is applicable only to systems with large client bandwidth 
and buffer size. In this paper we design the first on-line scheduler for 
stream merging in the general setting, in which we lift the large resource 
requirements, and our scheduler achieves constant competitive ratio. 



1 Introduction 

We study an on-line scheduling problem arising from video-on-demand (VOD) 
systems that support stream merging. A VOD system often receives many re- 
quests for a hot video over a short period of time (say, Friday 7 pm to 9 pm). If 
the video server uses a dedicated video stream to serve each request, the band- 
width requirement for the server becomes enormous. A simple alternative is to 
batch [1,5] the requests so that a single stream is issued to serve requests arrived 
over a period of time; however, this increases the response time substantially. 

To reduce the bandwidth requirement without sacrificing the response time. 
Eager et al. [6,7] proposed a promising technique called stream merging, which 
assumes that a client is equipped with extra buffer and can receive data from two 
streams simultaneously. The technique takes advantage of the assumptions as 
follows. Initially, a client receives data from a dedicated stream X for immediate 
playback. After some time, the client may start to buffer data simultaneously 
from another stream Y initiated earlier (due to another client). Suppose Y is 
initiated t time units before X. Then after listening to both X and Y for t time 
units, the client no longer needs X as it can play back the next t time units 
using the data in the buffer, while further buffering data from Y simultaneously. 
Hence, X may terminate now. We say that X merges with Y when the client 
starts to count on Y only. Merging streams allows several clients to be served 
using only one full stream plus a number of partial streams, thus saving the 
bandwidth of the server. Figure 1 shows an example. 

* Research supported by the Hong Kong Research Grants Council HKU7103/99E. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 432-442, 2001. 
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Fig. 1. (a) At time 0, client B is served by stream Y. (b) At time 1, client A 
arrives. It is served by stream X; it simultaneously buffers data from Y. (c) 
At time 2, A plays the content of its buffer and continues buffering from Y. X 
terminates. Assume the video length is 120 time units. The required bandwidth 
is 121 instead of 240. 



An on-line scheduler for stream merging determines whether each existing 
stream should continue to run or merge with another earlier stream without 
the knowledge about the requests not yet arrived. The goal is to minimize the 
total bandwidth used, i.e., the total duration of all streams. Such schedulers 
were first studied in a primitive form in [3,9,10], in which a stream is allowed to 
merge with a full stream only. Eager et al. [6-8] gave a general on-line scheduler 
that performs well empirically. Bar-Noy and Ladner [2] was the first to study 
the problem using competitive analysis. An on-line scheduler is said to be c- 
competitive, or has competitive ratio at most c, if for any sequence I of requests, 
it uses at most c times of the bandwidth used by the optimal off-line schedule in 
serving I. In [2], Bar-Noy and Laner gave an on-line scheduler with competitive 
ratio at most minjlog^ n, log^ -I- 0(1), where (j) = {1 + -\/5)/2, and O < 1 is 
the guaranteed startup delay measured as a fraction of the time for a full stream. 
In [4], we gave the first on-line scheduler with constant competitive ratio. 

All the above-mentioned results assume that the total client bandwidth is at 
least twice the playback bandwidth (half of it is for normal playback and the 
additional half is for buffering an earlier stream) and the client buffer is sufficient 
to store up to half the size of a video. Eager et al. [8] pointed out that in reality, 
clients may have limited bandwidth and buffer. For the general problem-the 
(w, ,B)-stream merging problem, in which the client additional bandwidth is only 
a fraction oj (0 < w < 1) of the playback bandwidth and the client can buffer 
at most B {B > 0) time units of video, only empirical results are presented (see 
[8]). In [2], Bar-Noy and Ladner leaved as an open problem whether the requests 
can be handled competitively in this general setting. 

In this paper we give the first on-line scheduler for (w, ,B)-stream merging 
with constant competitive ratio (precisely, 5) for any given 0 < w < 1 and any 
B > 0, and thus solving the open problem of Bar-Noy and Ladner. Note that the 
competitiveness of our scheduler does not deteriorate even if the bandwidth and 
buffer are limited. Our result is based on a novel observation that the behavior 
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of any scheduler can be modeled by rectilinear (binary) trees on a triangular 
grid, and we will give the details in Section 2. In Section 3 we present a general 
methodology for constructing a competitive on-line scheduler from some near- 
optimal rectilinear tree. In Section 4 we show how to construct such near-optimal 
rectilinear tree. 



2 Concepts and Notations 

2.1 Preliminaries 

In our discussion, time is divided into intervals of unit length, which can be 
arbitrarily small. Requests arriving within the same interval are treated as a 
single request and they are served by multicasting one stream. We suppose both 
video length C and buffer size B are measured in terms of time units. 

Note that with the additional bandwidth being only a fraction ui of the 
playback bandwidth, it takes longer time for a stream X to merge with a stream 
Y initiated t time units earlier. It is not diffcult to see that X takes time 

units to buffer data from Y before the two streams can merge (see [8]). We call 
C = |"l/w] the catch up ratio. Observe that for X merging to Y successfully, 
the buffer size B could be equal to t, and since X must merge with Y before Y 
terminates, we require t + Ct < L, or equivalently t < £/(l + C). Thus, B can be 
as small as t < £/(l-|-C). On the other hand, if B is smaller than £/(l-|-C), the 
optimal off-line schedule has only limited choices to merge the streams, and an 
on-line scheduler can simulate the off-line schedule easily (details given in the 
full paper). Thus, we only need to consider the case where B — £/{l+C). 

A request sequence is said to be compact if the last request arrives within B 
time units from the first request. Notice that such a sequence contains at most 
B + \ requests. Intuitively, all streams initiated for a compact request sequence 
can merge successfully before the first stream terminates, and the first stream 
initiated must be a full stream. In fact, using an observation of Bar-Noy and 
Ladner [2], we can show that for compact request sequences, it suffices to focus 
on schedules with the following property: 

The first stream is the only full stream and all other streams merge with 

other streams eventually. 

Since the first stream always contributes a dominating portion in the overall 
bandwidth usage of any schedule, the overall bandwidth of different schedules 
may not differ a lot. To make a more meaningful comparison between different 
schedulers, we define the full cost to be the bandwidth used by all the streams, 
and define the merge cost to be the bandwidth used by all except the first stream. 
Note that the full cost and merge cost differ by exactly the video length £. 

Compact request sequence is a key notion for stream merging because a gen- 
eral request sequence can be processed competitively if each consecutive compact 
request sequence can be processed competitively. More precisely, suppose A is 
an on-line scheduler for the problem such that with respect to compact request 
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sequences, its merge cost is at most c times that of the optimal off-line sched- 
ule. We can use A to process arbitrary request sequence as follows. As requests 
arrive, we divide them into groups. When a request arrives within B time units 
from the first request of the current group, it is inserted to this group, and we 
use A to schedule a stream for it and determine how the stream merges with 
other streams. Otherwise, we start a new group and initiate a full stream for 
this request. Notice that each group of requests is in fact a compact request 
sequence. The following theorem, which generalizes the one in [2], asserts that 
the bandwidth used by the resulting schedule is at most c + C!{C — B) times 
that of the optimal off-line schedule. The proof will be given in the full paper. 

Theorem 1. Let A he an on-line scheduler such that with respect to compact 
request sequence, its merge cost is at most c times that of the optimal off-line 
schedule. Then for any arbitrarily long request sequence, using A to schedule 
separately each compact request sequence can produce a schedule with an overall 
bandwidth at most c -\- £/(£ — B) times that of the optimal off-line schedule. 

In the rest of this paper, we focus on compact request sequences and give an 
on-line scheduler whose merge cost is at most three times that of the optimal 
off-line schedule. Then, by Theorem 1, the competitive ratio of this scheduler 
is at most i + C/{C — B) = 5 because C > 1, and we are consider the case 
B = C/{1 -\-C). Unless specified otherwise, all request sequences mentioned are 
meant to be compact request sequences. To ease discussion, we assume that the 
first request of all compact request sequences arrives at time 0. 

2.2 Binary Merge Trees 

In this section we show how to model schedules using binary merge trees, which 
attempt to model the occurrences of merging and the status of each stream.^ 
Below, we use the notation r(s) to denote the time when a stream s is initiated. 

A binary merge tree is a rooted, ordered binary tree. For any node with 
(out-)degree one, we require the node to have a left son but no right son. Every 
node u is associated with a stream s„ and a time In particular, every leaf (. 
corresponds to a distinct stream that is initiated due to a request, and ti = r{si). 

For an internal node u, its associated time is always greater than that of its 
sons, and its stream is always inherited from its left son. If u is of degree two, 
u and its right son, denoted by v, represent the occurrence of a merging. We 
require that r(s„) > r(su) and = t„-|-C(r(s„) — r(s„)). At time the stream 
Sy (precisely, the clients served by s„) starts to buffer the content of s„; after 
C(r(sy) — r(sy)) time units (i.e., at time t„), merges with Sy and terminates. 

Regardless of whether u is degree one or two, u and its left son w corresponds 
to a continuation of the same stream from time ty, to time ty . That is, the stream 
has no plan to merge with another earlier stream during this interval. Note that 
the root as well as all the nodes on the leftmost path of the tree are associated 

^ This is in contrast to the merge trees used by Bar-Noy and Ladner [2], which model 
the relationship among the streams. 
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with the same stream, which does not merge with any stream and runs till the 
end. The time associated with the root is exactly C. 

The cost of an edge (u,v) is defined as tu — ty. The full cost of a binary 
merge tree is the sum of the cost over all edges, and the merge cost is the full 
cost minus the cost of the edges on the leftmost path. 

Recall that we focus on compact request sequences and schedules in which all 
streams merge with the first one. Note that each binary merge tree corresponds 
to one of these schedules with the same full cost and merge cost, and vice versa. 
Thus, for any compact request sequence, there is a binary merge tree with full 
cost and merge cost equal those of the optimal off-line schedule. Below, we show 
that the optimal binary merge tree, the tree associated with an optimal off-line 
schedule, can be assumed to have a more uniform structure. 

Lemma 1. Consider any compact request sequence I. There is an optimal bi- 
nary merge tree for I such that all the internal nodes, except possibly the root, 
have both left and right sons. 

Proof. Let T be the optimal binary merge tree with the minimum number of 
nodes. Assume there is a degree one internal node v. By definition, v has a left 
son, say w. Let u be the parent of v. If v is the left son of u, we remove v and add 
an edge between u and w. The cost is not changed; contradicting that T has the 
minimum number of nodes. If v is the right son of u, Sy starts buffering from Sy 
at ty. We make Sy starts buffering from s„ earlier, say at ty,. This decreases the 
merge cost of the schedule and contradicts the optimality of T. □ 



2.3 Rectilinear Trees 

This section discusses a powerful tool called rectilinear trees to model schedules 
of stream merging. The new tool is more useful for visualizing the difference 
between an arbitrary schedule and the optimal off-line schedule. Roughly speak- 
ing, rectilinear trees are binary merge trees embedded on a triangular grid. Let 
K = B -\- 1. Consider a K x K grid with K grid points (i.e., with K — 1 grid 
arcs) on each boundary. The grid is distorted into a rectangle: The horizontal 
grid arc has a length of (1 -I- C) and the vertical grid arc a length of C. 

A grid line is a sequence of consecutive horizontal (or vertical) grid arcs. 
A grid path is a sequence of grid arcs connecting any two grid points. The rows 
and columns of the grid are numbered 0 to iL — 1 from top to bottom, and 
from left to right, respectively. The triangular grid in concern is the upper right 
triangle with grid points (0, 0), (1, 1), • • • ,{K — 1,K — 1) as the hypotenuse. 

A rectilinear tree is a binary tree on the triangular grid satisfying the follow- 
ing properties. (1) The root is at the top rightmost corner of the triangular grid 
and the leaves are on the hypotenuse of the triangular grid. (2) Any node and 
its parent are either on the same row or on the same column, i.e., any edge is 
either a horizontal or a vertical grid line. (3) The grid path from any leaf to the 
root is monotone — The rectilinear distance to the root is strictly decreasing as 
we traverse from the leaf to the root. 
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For any rectilinear tree T, we define the length of an edge to be the length 
of the corresponding horizontal or vertical grid line. The full length of T is the 
total length of all its edges, and the branch length is the total length of all edges 
except those edges along the leftmost path of the tree. Notice that two rectilinear 
trees with the same set of grid arcs have the same branch length even if the set 
of nodes may differ. Branch length is a notion heavily used below; to simplify 
our discussion, we denote by 6{T) the branch length of T. The following theorem 
shows the correspondence between binary merge trees and rectilinear trees. 

Theorem 2. Any binary merge tree can be embedded on a triangular grid as a 
rectilinear tree with branch length equals the merge cost of the binary merge tree. 

Proof. Let T be a binary merge tree. For any node u in T, we create a node ur at 
the grid point (r(s„), (t„-|-Cr(s„))/(l-|-C)). If u is a leaf, ur is on the hypotenuse. 
The root becomes a node at the upper rightmost corner of the grid. If u is an 
internal node, let w be the left son of u. wr is at (r{s^), {t^ -|-Cr(su,))/(1 -fC)). 
As Su is the same as Sw and tu > t^, ur lies on the right of wr on the same row. 
Thus, the edge (u,w) becomes a horizontal grid line between ur and wr, with 
length (1 -l-C) X (tu + Cr{su) — — Cr{sw))/{1 +C) = — t^, which equals the 

cost of the edge (u, w) in T. The case for the right son can be proved similarly. 
The branch length of the rectilinear tree equals the merge cost of T. □ 

3 The On-line Scheduler 

A crucial issue in designing on-line scheduler is to estimate the cost of the opti- 
mal solution. We give a simple way to estimate the cost of the optimal solutions 
of all compact request sequences using only one rectilinear tree, called the com- 
plete rectilinear tree. This rectilinear tree when restricted to any compact request 
sequence / gives an attainable estimate of the branch length of the optimal rec- 
tilinear tree for I . Our on-line scheduler makes reference to this tree to schedule 
the merging of streams; the resulting schedule corresponds to a rectilinear tree 
with branch length always upper bounded by the estimated branch length of the 
optimal rectilinear tree. 

Define a complete rectilinear tree T to be a rectilinear tree whose leaves 
comprise all the grid points along the hypotenuse of the triangular grid. For 
any compact request sequence I, for simplicity, / also denotes the subset of grid 
points along the hypotenuse corresponding to the requests. The rectilinear tree T 
when restricted to I is called the partial rectilinear tree of T with respect to I. 
It is a rectilinear tree denoted by T\\i whose set of leaves is I and set of grid 
arcs is a subset of that of T. In other words, for any grid arc e in T, if e is along 
a path of T from the root to some leaves in I, e is in T\\i] otherwise, e is not 
in T\\i. Note that T||/ is unique for any given /. 

In this section, we describe a methodology which makes reference of a given 
complete rectilinear tree T to derive an on-line algorithm At. For any compact 
request sequence I, At gives a schedule whose corresponding rectilinear tree, 
denoted by At{I), has a branch length at most that of T\\i. That means for 
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any given T and / we can produce a schedule with merge cost at most S(T\\i). 
On the other hand, there is an optimal binary merge tree for / satisfying the 
property stated in Lemma 1. By Theorem 2, this tree can be embedded on the 
triangular grid as a rectilinear tree, denoted by OPTj. In Section 4 we describe a 
particular complete rectilinear tree S such that <5(5' ||/) < 3S{OPTi), and hence 
the merge cost of the schedule produced by As is at most three times that of the 
optimal off-line schedule. Together with Theorem 1, we have the next theorem. 

Theorem 3. ^5 is 5- competitive. 

In this section we assume T is a chosen complete rectilinear tree. We give 
an on-line scheduler At which constructs a rectilinear tree based on T and a 
compact request sequence. The scheduler makes reference to T to determine how 
the streams serving the requests merge with each other. Details of At are as 
follows. 

When the first request arrives (at time 0), the scheduler initiates a stream 
for this request and this stream will not merge with any stream. The initial 
rectilinear tree contains only two nodes, namely, the root at the top rightmost 
corner and a leaf at the top leftmost corner, of the triangular grid. These two 
nodes are connected by a horizontal grid line. All the grid points along this grid 
line are associated with the first stream. 

When a subsequent request arrives, it is assigned a dedicated stream X im- 
mediately. We also determine when X should begin buffering data from which 
earlier stream. Let (r, r) denote the leaf corresponding to current request where r 
is its arrival time. We locate in T the least common ancestor a of (r, r) and the 
leaf corresponding to the request just before the current request. A new grid 
path is added to the current rectilinear tree as follows. 

If a lies on column r, the new grid path is a vertical grid line running up- 
wards from (r, r). The path stops at the first grid point it intersects with the 
current rectilinear tree. In this case, X immediately buffers data from the stream 
associated with the intersecting grid point. 

If a does not lie on column r, it must lie on the right of column r because of 
the monotone property of the paths in T. Suppose a lies on column Ca- The new 
grid path comprises a horizontal grid line from (r, r) to (r, c^) and a vertical grid 
line from (r,Ca) to the first grid point it intersects with the current rectilinear 
tree on column Ca- In this case, (1 -|-C)(ca — r) time units after X is initiated, 
X starts buffering from the stream associated with the intersecting grid point. 

All the grid points along the new path, except the intersecting point, are 
associated with X. The decision is made every time a request arrives. For any 
compact request sequence /, the next lemma bounds S(At(J^)) by 5(r||/). 

Lemma 2. For any complete rectilinear tree T and compact request sequence I, 
6 {At{I))<S{T\U). 

Proof. (Sketch.) Suppose / has k requests. For 1 < f < A, let A be the first i 
requests in I, and Alp be the rectilinear tree constructed by At for A. We claim 
that Ap is always on the right of T\\j., precisely, for any grid point (r, c) G T|l/., 
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Fig. 2. Suppose C = 1. A 12 x 12 triangular grid G is divided into an 8 x 8 and a 
4x4 triangular grids Gi and G 2 . (a) The complete rectilinear tree S on G. The nn- 
shaded area is the rectangle R with top rightmost and the bottom leftmost corners at 
(0, 11) and (8, 7). (b) An optimal rectilinear tree for a given set of compact request se- 
quence with seven requests, (c) After a recursive step the grid arcs within the rectangle 
(unshaded area) are removed and the grid arcs on the four boundaries are included. 

there exists a grid point (r', c) G .4^ with r' > r. Since the path in T is monotone, 
the grid path added to by At for the i-th request has a length at most 

that of the path added to to form T\\i.. Thus, 6{At(I)) < <5(T’||/) as they 

are the sums of the corresponding path lengths. □ 

4 A Good Complete Rectilinear Tree 

In this section we describe a complete rectilinear tree S on a, K x K triangular 
grid A, which provides a good estimate of all optimal rectilinear trees. Precisely, 
we show that ii(S'|| 7 ) < 3S{OPTi) for all compact request sequences I. 

We construct S recursively. For any integer n, let /(n) = [((1 + C)/(l + 
2C)){n — 1)J -I- 1. The root of S is at (0, AT — 1) of A. All the K grid points along 
the hypotenuse are the leaves of S. If S has only one leaf (i.e., when AT = 1), we 
are done. If S has at least two leaves, it has two subtrees. The left subtree has its 
root at (0, /(AT) — 1) and it contains the top f{K) leaves along the hypotenuse. 
The right subtree has its root at (f{K), AT — 1) and it contains the rest K — f{K) 
leaves. These two subtrees are then constructed recursively, and their roots are 
connected to the root of S' by a horizontal grid line with K — f{K) grid arcs and 
a vertical grid line with /(AT) grid arcs. See Figure 2(a). 

To prove ^(Sll/) < 36{OPTj), we derive a procedure Transform that trans- 
forms OPTi into a rectilinear tree 0/ that uses only grid arcs of S, and whose 
set of leaves containing I. Transform is conservative in replacing the edges 
of OPTj so that S{Oi) < 3S{OPTi). On the other hand, S\\i contains the mini- 
mal subset of grid arcs in S to form a rectilinear tree with / as leaves and using 
grid arcs of S only. Thus, i5(S'||7) < 6(Oj), and 5(S'||7) < S{Oj) < 3S{OPTj). 

4.1 The 'Transformation 

Transform is recursive; it takes two parameters G and T. G is a triangular 
grid, and T is a rectilinear tree embedded on G whose root is at the top rightmost 
corner of G. Initially, G = A and T = OPTj. In the first step of Transform, 
consider the rectangle R whose top rightmost corner is at (0, A" — 1) and bottom 
leftmost corner is at (/(A'), /(AT) — 1).^ See Figure 2(a). Note that S does not 

^ R, excluding its bottom leftmost corner, is enclosed entirely in A. 
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cross R; more precisely, no path in S enters R from a boundary of R, goes through 
at least one of its interior grid points, and leaves from a distinct boundary. Thus, 
we must remove all grid arcs of OPTj within R. 

If OPTj has a path crossing R from the bottom boundary, we expand OPTj 
to include all the arcs on the bottom and right boundaries of R, except the 
leftmost one (which is outside A). If OPTj has a path crossing R from the 
left boundary, we include arcs on the left and top boundaries of R, except the 
bottommost one. We then remove all grid arcs that are within R. See Figure 2(b) 
and (c). New nodes are created on any new intersections of grid arcs introduced. 

Let Tq be the new tree formed after the above step. Ignoring the interior grid 
points of R, we partition A into two disjoint triangular grids: an f{K) x f{K) 
one Gi to the left of R and a, {K — f{K)) x {K — f{K)) one G 2 below R. See 
Figure 2(c). Consider the part of Tq in G\. All paths in Tq starting from the 
leaves in G\ to the root of Tq exit G\ through its top rightmost corner. If this 
corner is not a node of To, we add a node there. Then, the part of To in G\ forms 
a tree Tj that is embedded on Gj with the root at its top rightmost corner. If Gj 
is at least 3x3 and Ti contains at least a leaf, we recursively call Transform 
on Gi and Tj to form a tree T{ that uses only grid arcs of S and retains the 
corresponding leaves. We handle the part of Tq in G 2 similarly, and let T 2 be the 
resulting tree. Note that T{ and T 2 , together with the tree edges of To on the 
boundaries of R, form Oj. 

4.2 Analysis of Procedure Transform 

In this section we prove that S{Oj) < 3S{OPTj). Let Ea and Er be the set of 
grid arcs added and removed by calling Transform on G = Z\ and T = OPTj. 
For any set of grid arcs E, we denote by \E\ the total length of grid arcs in E. We 
first show that \Ea\ < 3|£’r|. Note that it suffices to show that in each recursive 
step (excluding the recursive calls), the total length of grid arcs added is at most 
three times that removed. Since the analysis is the same for every recursive 
step, we only consider the first step, in which the grid arcs in OPTj within R 
are removed, where R is the rectangle whose top rightmost and bottom leftmost 
corners are at (0, AT — 1) and {f{K),f{K) — l) respectively. Let £a and Sr denote 
the set of grid arcs removed and added in this particular step respectively. 

Lemma 3. If OPTj crosses R, OPT/ contains a path crossing R vertically or 
horizontally, i.e., the path passes through a single column or a single row of R. 

Proof. Suppose OPTj has a path a from a leaf to the root that crosses R. a ex- 
its R through either the right or top boundary. If a exits through the right 
boundary, say on row i, a or another path in OPTj must go through R horizon- 
tally on row i. Otherwise, there is a node in OPTj having at most one son. This 
contradicts Lemma 1 and Theorem 2. The other case can be proved similarly. □ 

If OPTj does not cross R, obviously \£a\ <3 \£r\] otherwise, we have two cases 
depending on whether OPTj has a path crossing R vertically or horizontally. 

Let r{K) and t{K) denote the length of the right and top boundaries of R 
respectively. In other words, r{K) = Cf{K) and t{K) = (I +C){K — f{K)). 




Improved On-Line Stream Merging: From a Restricted to a General Setting 441 



Lemma 4. If OPT/ contains a path crossing R vertically, \£r\ > r{K) and 

— |'£’r| < r{K)+t{K) — l—2C. //OPT/ contains a path crossing R horizontally, 
\£r\ > t{K) \£a\ ~ |'£’r| < ''’(//) -|- t{K) — 1 — 2C . 

Proof. We only prove the first scenario, and the second one can be proved sym- 
metrically. Let i/top and i/in be the sets of horizontal grid arcs in OPTj that are 
on the top boundary of R and inside R, respectively. Thus, \£r\ > r{K) + |i/in|. 

Suppose OPT[ has a path crossing R from the left boundary at grid point p. 
Transform expands OPTj to include all four boundaries, excluding the two 
arcs adjacent to the bottom leftmost corner. Thus, \£a\ < 2r{K)+t{K) + {t{K) — 
l^f^topl) — 1 — 2.C. Consider the path a from p to the root of OPTj. The total 
length of horizontal grid arcs in cr is at least t{K), and is at most |i/top| + |L^in|- 
This implies t{K) < |i/top| + |L/in|. Therefore, \£a\ — \£r\ < 2r{K) + t{K) + 
ft{K) - |i/top|) - 1 - 2C - (r(/f) -h |i/in|) < r{K) + t{K) -1-2C. The case for 
OPTj not crossing R from the left boundary can be proved easily. □ 

By simple arithmetic, we have r{K) > t{K) — 1 — C and t{K) > r{K) —C. By 
Lemma 4, \£a\ < 3 \£r\- Thus, \Ea\ < 3 \Er\. Since Ea and Er are disjoint, \Oj\ = 
\OPTj\ + \Ea\ - \Er\. Note that S{OPTj) = \OPTj\ - £ and S{Oj) = \Oj\ - £, 
where £ is the length of the top boundary of A. Therefore, 5{Oj) = 6{OPTj) + 
\Ea\ - \Er\. Since \Er\ < 5{OPTj), 5{Oj) < 5{OPTj) + 2\Er\ < 3S{OPTj). 

Lemma 5. 6{Oj) < 3<5(OPT/). 

Recall that ^(<S'||/) < 6{Oj). By Lemmas 2 and 5, we have 5(^s(/)) < 
'5(«5'||/) < <5(0/) < 3 6 (OPTj) and we have the following theorem. 

Theorem 4. Given any compact request sequence I, <5(^g(/)) < 3i5(OPT/). 
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Abstract. We consider an on-line non-preemptive deadline scheduling 
on multiple resources. Each job has the slack time during which it can 
wait until its expiration time, the last time it can be started to be com- 
pleted by the deadline. There are a number of resources on which the jobs 
are allocated. The effect of slack times on the on-line deadline scheduling 
was investigated in [8] by analyzing the performance of the Earliest EX- 
piration Time First (EXE) algorithm. But most of the previous work is 
restricted to a single resource, and in this paper, we propose a different 
method of analysis which can also be applied to multiple resources. So 
we can extend the result of [8] to multiple resources. 



1 Introduction 

Consider the following problem: we are given m resources and a set of job re- 
quests, where each job can be served by at most one of the resources and has 
the processing time for which it uses the resource and the deadline by which it 
should be completed, and each resource serves only one job at any time. Not all 
jobs are required to be scheduled and some jobs can be rejected but they cannot 
contribute to the throughput. Whenever a resource is idle, scheduling algorithms 
should determine which job to be processed on the resource so that it maximizes 
the total of gains of accepted jobs. 

The schedule is non-preemptive, that is, once a job is scheduled, it cannot be 
preempted or rejected and it is on-line, where the algorithm has no knowledge of 
jobs until they arrive and when they arrive, the algorithm is informed the pro- 
cessing times and deadlines of the jobs. The performance of an on-line algorithm 
is estimated through competitive analysis [9, 12], in which it is compared with 
that of an off-line optimal algorithm that knows all informations of the future 
requests. 

In this paper, the problem is described in terms of machine scheduling. Each 
job Ji has the release time ri, the processing time pi, and the deadline di and 
there are m identical machines on which jobs are scheduled. The expiration time 
Xi of a job Ji is defined to be the last time at which Ji can be scheduled to 
completion by the deadline, that is, di = Xi -\- pi and the slack time Si of Ji 
to be Xi — ri- If the slack time of a job is more than zero, before scheduling 
it, an on-line scheduling algorithm has a possibility of waiting for better jobs 
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which will arrive. Assume that for each job Ji, its slack time satisfies Si > npi 
for some constant k. Depending on the value of k, we will investigate the effect 
of the slack time on the performance of the on-line scheduling algorithm. In [8], 
Goldwasser proposed the Earliest EXpiration Time First (EXE) algorithm that 
schedules the job with the smallest expiration time among all available jobs at 
all times. His work is restricted to a single machine, as in the previous results [7] 
and [10]. In [7] and [8], the authors divide the time busy periods during which 
there is no idle time on the machine in the schedule of EXE. Since jobs scheduled 
within a busy period have no effect on the schedules of the other busy periods, it 
is sufficient to analyze the performance of EXE only in a busy period. But this 
cannot be applied to multiple machines, where busy periods on distinct machines 
are aligned to cross at some times. We provide a different analysis that can also 
show the performance of EXE on multiple machines, where the time is divided 
into periods in a different way according to the behavior of EXE. 

Let 5 be a set of job instances. Then EXF{S) (OPT{S)) represents the sum 
of the processing times of jobs scheduled by EXE (OPT, resp.) in S. Interchange- 
ably, for a time interval T, EXF{T) {OPT{T)) also represents the set of all jobs 
scheduled by EXE (OPT, resp.) in T. Also |5| is the number of jobs in S. 



1.1 Previous Work 

Off-line non-preemptive scheduling of jobs with deadlines is well-known in the 
literature from the early days [6, 11], but very recently, there have been active 
studies for this problem [1, 2]. where the jobs have arbitrary release times and 
weights and the types of machines are given as the single machine, identical 
machines, and unrelated machines. They give constant approximation algorithms 
using various optimization techniques, for example, LP-rounding, local search, 
and etc. 

In the on-line setting, it originates from the work of Lipton and Tomkins 
[10], where each job must be scheduled or rejected as soon as it arrives. For 
the general case in which the jobs have arbitrary processing times, they give 
0((log Z\)^+*^)-competitive algorithm and show that no 0(log Z\)-competitive al- 
gorithm can exist, where A is the ratio of largest to smallest processing time. In 
[7], they allowed the jobs to specify arbitrary slacks, i.e., n > 0, and the effect 
of the positive slack times was shown in [8]. For equal processing time jobs, he 
proves EXE is (1 -I- -|-^^j^)-competitive and shows that no on-line algorithm can 
have a better ratio, and for arbitrary processing time jobs, he gives a (2 -|- -)- 
competitive algorithm and shows that this is best possible for all deterministic 
on-line algorithms. But these results are all restricted to the model of a single 
machine. 

For on-line preemptive scheduling problem, there are several related studies 
[3, 4, 5]. In [4], the effect of slack times for preemptive scheduling on a single 
machine is investigated, and in [5], the performance of preemptive scheduling on 
multiple machines is characterized by the stretch factor which is the ratio of the 
response time to the processing time, that is, . 
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Fig. 1. The division of time intervals. 



2 Equal Processing Time Jobs 

2.1 Case [kJ > 1 

Let J be the set of all job instances. W.l.o.g., we assume each job has the 
processing time 1. First we consider the case [kJ > 1. According to the behavior 
of EXF, we divide the time into periods and classify the jobs in J . Here, observe 
the behavior of EXF. When a new job arrives, EXF puts it into a queue, and 
whenever there is an idle time on some machine, EXF selects the job with the 
earliest expiration time in the queue and schedules it on the idle machine. If a 
job has not been selected until its expiration time, it is rejected at that time. 

Inductively, we define the time periods Ti = i = 0, 1, • • •, and = 

oo. Assume that the time periods Tj,j = 0, • • • , i have been defined and there is 
a job which EXF rejects in (— oo, t^). Then we consider the job which is rejected 
in the last in (— oo,tJ) and denote it by A. Let ti be the time at which A is 
rejected and let Si be the release time of A. Then any job with expiration time 
> ti cannot be scheduled in [si, ti], because if such a job were scheduled, it would 
have an expiration time greater than A and EXF would schedule li. Thus if a 
job has an expiration time > ti, either it is scheduled after ti or before Si. We 
refer to the time ti as the critical time, and we will also define the time 
called the harrier time. Let t be the time at which an idle time last appears on 
some machine before s^. If there is no job with expiration time > ti which is 
scheduled in [i, Si), we set the time t^_^i to be t. Otherwise, let I be the job with 
expiration time > U which is last scheduled in [t, Si) and let t be the time when 
I is scheduled. Then at the time t, there is no job with expiration time < ti in 
the queue, because if such a job were in the queue, EXF would schedule it in 
place of I. Consider the job, denoted by I, which is first scheduled after t, and 
let i be the time at which I is scheduled. Then we can see that new jobs have 
been released in {t,t], because otherwise, if t > Si, then it is a contradiction to 
the fact that A was released at Si, and if t < Sj, then I were also in the queue 
at t, that is, / had the expiration time > ti, and it is a contradiction to the 
definition of I. Thus we set t\j^i to be the time at which a job is first released in 
{t,t\, and we call the half-open interval the principal period and denote 

it by See Figure 1. Then a principal period is given for some fc > 0, such 
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that either there is no job rejected in [0, or there is the job Ik last rejected 
in but there is neither an idle time on some machine nor a job with 

expiration time > tk in [0, Sk), where tk and Sk are the critical time determined 
by Ik and the release time of Ik, respectively. 

In the following, we will investigate the characteristics of the principal periods 
Ti, i = 0, - ■ ■ ,k. 

Lemma 1. In each principal period Ti, there is no idle time on any machine in 
EXF’s schedule between the barrier time t\ and the critical time ti, and each job 
scheduled in [ti,ti] should have an expiration time < ti. 

Proof. Fix a principal period Ti. Let li be the job which is rejected by EXF at 
ti and Si be the release time of L. From the definition of the barrier time t^, we 
see that no job with expiration time > ti is scheduled in Si) and no idle time 
exists on any machine in that interval. After is released at Si, no idle time 
appears on any machine, and EXF never schedules any job with expiration time 
> ti until li is rejected at U, because otherwise, EXF could have scheduled A. 

In each principal period Ti, we call jobs in EXF{{ti,t\_i)) {OPT{{ti,t\_i))) 
late jobs of EXF (OPT, resp.) and CEXpiTi) {h^OPT{Ti)) represents the set of 
all late jobs of EXF (OPT, resp.) in Ti. In the next lemma, it is shown that each 
late job of OPT should be scheduled by EXF and it is also a late job of EXF in 
some principal period. 

Lemma 2. In each principal period Ti, all late jobs of OPT in Ti must be sched- 
uled by EXF and in particular, they are late jobs of EXF in some Tj. 

Proof. Fix a principal period Ti and let J be a late job of OPT in Ti. Then it 
has the expiration time e > ti. Assume it was not scheduled by EXF before or 
at ti. Then either, at ti, J was in the queue of EXF or J is released after ti. If 
ti < e < t\_i, J is scheduled in {ti,t\_-^), because EXF never rejects any job in 
the queue within the interval. Otherwise, i.e., e G Tj, for some 0 < j < i — 1, 
then in the case the barrier time is determined by an idle time, J is scheduled 
before because if it were not scheduled until tJ, it would be alive in the queue 
at t^j and it would be scheduled. Suppose is determined by a job J with an 
expiration time > tj and J is not scheduled before If < e < tj, it is a 
contradiction, because J has the expiration time greater than J but J would 
be in the queue at the time when J was scheduled. So tj < e < t^j-i- From 
Lemma 1, J is not scheduled within the interval [t^tj] and it is in queue at 
tj. Since EXF never rejects any job in the queue within {tj,t’j_i), the job J is 
scheduled. 

If a job J with a processing time I starts to process at s and s <t < s-\-I, the 
job J is said to cross the time t. For each principal period Ti, we define frontier 
jobs of Ti in EXF (OPT) as the jobs scheduled by EXF (OPT, resp.) to cross 
the time ti — [ti — denoted by fi. In particular, E{T) represents the set of 
all frontier jobs of a principal period T in EXF. Here we define new periods {Tj} 
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from the principal periods {Ti}. In nondecreasing order, two adjacent principal 
periods may be merged into one satisfying the following criterion. Assume new 
periods Tk,k = 0, - ■ ■ ,j — 1, are defined from Tk,k = 0, - ■ ■ ,i — 1. Let gi^ be the 
time t^. + [tk — + 1. First set Tj to be Ti. If gi+i > fi, then set Tj to be 

Tj UTi+i, that is, two principal periods Ti and T^+i are merged, and otherwise, 
set Tj+i to be T^+i. Thus the new periods {Ti} are obtained, each of which 
consists of either one principal period or two principal periods. We will refer 
to the sole period as the period which consists of one principal period and the 
combined period as the period into which two principal periods are merged. 

In the following lemmas, we will bound \EXF{Ti)\ and \ OPT{Ti)\ for each i. 

Lemma 3. For each sole period Ti, 

\FXF{fi)\ > \T{Th,) n FXF{Ti)\ + m[th, - + |£bxf('T^J|, 

where Ti is the principal period Th^ ■ 

Proof. Fix a sole period Ti. From Lemma 1, there is no idle time on any machine 
in [t'f,.,thi] the schedule of EXF. So EXF can schedule at least m[thi — t'fj 
jobs which are not the frontier jobs of T^. within this interval. 

Lemma 4. For each sole period Ti, 

\OPT{fi)\<m{[th,-tlj+l) + \CopT{Th,)\, 
where Ti is the principal period Tf^ . 

Lemma 4 is obvious, since any scheduler can allocate at most J -1-1) 

jobs in [t^.jtfii], and Lemma 5 can be shown similarly to Lemma 3. 

Lemma 5. For each combined period Ti which consists of two principal periods 
^hi and 

\FXF{fi)\ > \T{Th,+i) n FXF{Ti)\ + m{[th, - 4j 

hi + 1 

Lemma 6. For each combined period Ti which consists of two principal periods 
Tfii and Th^^i, 

\OPT{W\ < milt,,, - tl\ + lt„,+i - tl+,\ + 1) + \£opt{ThJ\. 

Proof. Consider frontier jobs of T/j. in OPT. Then there are at most mlth, — 
jobs which are not the fronter jobs of T/j. and are scheduled by OPT in , th,]. 
Also since gh,+i > fh,, at most mlthi+i — jobs which are not the fronter 

jobs of T/j. can be scheduled by OPT in [t\.^i,t\,). Thus since there can be at 
most m frontier jobs of T^,, the upper bound is obtained. 
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Fig. 2. The schedule of EXF in the case that Tj is the sole period. 



From Lemma 2 ~ 6, we will show the performance of EXF when [kJ > 1. 

Theorem 1. The EXF algorithm is (1 + ) -competitive, where [kJ >1. 

Proof. From Lemma 2, all late jobs of OPT also are late jobs of EXF, that is, 
for each sole or combined period Ti,i = 0, ■ ■ ■ , K, 

i 3 

where the sole or combined period consists of the principal period T^. or the 
principal periods Th^ and respectively, and the right sum is done over all 

principal periods Tj. For convenience, assume that Tk is the combined period 
and there is neither an idle time on some machine nor a job with an expiration 
time > thK+i scheduled in Then \T{Thi^+i) H EXF{Tk)\ = m. Thus 

we obtain the following from Equation (1) and Lemma 3 ~ 6: 

K-l 

OPT{J) < EXF{J) +Km~Y, n EXF{fi)\, 

i=0 

where we define ki = hi and ki = hi + \ for each sole period and combined period 
Tj, respectively. Also the lower bound of EXF{J) is given by the following claim. 

Claim. EXF{J) > Km\_n\ + Km - \J^(TkJ n EXF{f,)\. 

Let A^ = HTk,) - EXF{fi) and B, = EXF{[tl^, fn,]) C Cexf{Th,)- Then A. 
consists of the jobs which are frontier jobs of T^. and not scheduled within Tk^, 
and if Ti is a combined period, Bi contains all jobs in EXF{Tkf), all frontier 
jobs of Thi, and all late jobs of EXF in and if Ti is a sole period, it contains 
the frontier jobs of T^. belonging to EXF{Thf) and all late jobs of EXF in Tf^. 
First we can see from Lemma 1 that for each i = 0, • • • , AT — 1, \EXF{Ti) —Bi\> 

. Let A = and B = U^gBi. We define a mapping I : A — B 

as follows. Given J G A. Then J G T{Tk^) — EXF{Tj), for some j. If J is the late 
job of EXF in Th^^^, that is, J G EXF ({thj^^,t\^)) ^ then set I{J) = J, and if 

J G EXF{[Cj^,^^ , then in the case that Tj is the combined period, there is a 
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job J' G EXF{[tl. _,tkj)) on the same machine as J, because ~tk \ — — 1- 

For convenience, we choose the job J' scheduled next to J and set X{J) = J' ■ In 
the case that Tj is the sole period, there is a frontier job J' G , fhj+i]) 

on the same machine as J, because otherwise, if the last job in EXF{[t \,^^ , 

on the machine is completed a,t t,t < gh^+i < fk, since Tj is the sole period, that 
is, J cannot be the frontier job of Tk^ and it is a contradiction (see Figure 2). 
Then we set T{J) = J' ■ It is obvious that the mapping X is one to one, which 
says \A\ < \B\. Let Si = E(TkJ n EXF{f,). Then since |^| = Km- \S,\, 

we can prove the claim from the following: 

K-l 

EXF{J) > Y, \EXF{f,) -B,\ + \B\ 

i^O 

K-l 

>mY - 4j + l-4| 

i^O 

K-l 

> Km[n\ + Km — ^ \Si\. 

i=0 



Consequently, we obtain the desired bound: 

OPTjJ) ^ Km-Y.^\S^\ ^ 1 

EXF{J) ~ Km[K\ + Km — \Bi\ ~ + 1 



2.2 Case [kJ = 0 

In this subsection, when [kJ = 0, it is shown that the EXF algorithm is 2- 
competitive. We obtain the set of the principal periods {Ti\ as in the previous 
md merge them differently. New periods Ti are obtained as follows: Assume 
Tfc, fc = 0, • • • , j — 1, are obtained from T^, fc = 0, • • • , t — 1. If there is a job which 
starts to process in ^md is completed after ti, for some I > i, then let 

be the principal period having the largest index u > i in which such a job exists, 
and we merge the principal periods Ti, - ■ ■ ,Tu into one as Tj . Otherwise, set Tj 
to be Ti. Thus the periods Ti which consist of one principal period and principal 
periods more than one are called long periods and short periods, respectively. Let 
and f7i be the set of all jobs which is released within Ti and Ti, respectively. 
Then we compare EXF{Ji) with OPT{Ji) for each i. In each Ji, the jobs having 
expiration times > ti are called patient jobs and it can be shown similarly to 
Lemma 2 that all patient jobs should be scheduled by EXF. Also we note that 
in each Ti, all jobs scheduled by EXF in [ti,ti] belong to Ji. 

Lemma 7. In each principal period Ti, all jobs scheduled by EXF in [ti,ti] be- 
long to Ji, that is, they are released after or at the barrier time t\ ofTi, and all 
patient jobs of Ji must be scheduled by EXF. 
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Lemma 8. For each long period Ti, 

OPT{Ji) < EXF{Ji) + m. 



Lemma 9. For each short period Ti, 

OPT{Ji) < EXF{Ji) + m. 

Proof. Fix a short period Ti which consists of principal periods Tj, ■ ■ ■ ,Tk- If 
9k = + Ufc ~ + 1 ^ tjj then since Ti is the short period, we see that 

tj — tfc < 1. Thus OPT can schedule at most m{\tk — t\.\ + 2) jobs in [t\, tj\. So 
the upper bound of OPT{Ji) is given by 

OPT{Jf)<m{\tk-tl\+2) + \Vj\, 

where Vj is the set of all patient jobs of Jj. But EXF schedules at least m\tk—t\\ 
jobs which are not the frontier jobs of T^ in Also the m frontier jobs of 

Tj exist, and if some of them start to process before or at tk, there is the frontier 
job of Tk on the same machine, because gk < tj. Thus we can obtain the m jobs 
of fli each of which is either a frontier job of Tj scheduled after tk or a frontier 
job of Tk- Therefore the following is given from Lemma 7: 

EXF{J,)>m{[tk-tl\+l) + \Vj\. 

If gk > tj, then OPT can schedule at most m{[tk — t^\ + 1) jobs in [tk,tj] but 
EXF schedules at least m\tk — jobs in Thus the desired equation is 

obtained. 

Theorem 2. The EXF algorithm is 2- competitive, where [kJ = 0. 

Proof. Given the long or short periods Ti,i = 0, • ■ ■ , K — 1. Actually it remains 
to prove '^i EXF(fli) > Km. Fix Tj and we assume that Tj consists of the 
principal periods Tj, - ■ ■ , Tj+k, where fc > 0, and fc = 0 if is the long period. 
Then from the definition of the critical time tj we can see that there are exactly 
m jobs crossing tj. Thus we assign the m jobs to Ti. If Ti is the long period, 
any job scheduled before or at tj+i cannot be completed after tj, and if Ti is the 
short period, any job scheduled before or at tj+k+i cannot be completed after 
tj, because if there is such a job, it is a contradiction to the maximality of Tj+k 
in the definition of Ti. Thus we can see that the m jobs assigned to Ti cannot be 
re-assigned by any other periods and the desired fact is proved. Consequently, 
it is followed from Lemma 8 and 9 that 

OPT{J) < EXF{J) + Km< 2EXF{J). 

Until now, we have proved that EXF is (1 -I- )-competitive, for k > 0. 
Next, it is shown that the competitive ratio is tight by giving the lower bound. 
The instance of the lower bound is given just by making m copies for each job 
in the instance given in [8]. 
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Lemma 10. For k >0, 



max 

J 



OPTjJ) ^ 1 

EXF{J) - L'«J + 1 



and, for any online algorithm A, 



max 

J 



OPTjJ) ^ 1 

AjJ) - 2[Kj-h3' 



3 Arbitrary Processing Time Jobs 

As in the last section, the principal periods are defined, and the barrier times 
and the critical times ti which determine Ti are given. Let Ji be the set of all 
jobs which are released within Ti . It can be shown similarly to Lemma 7 that all 
patient jobs of Ji having expiration times > F should be scheduled by EXF. 

Lemma 11. In each principal period Ti, there is no idle time on any machine 
between the harrier time t\ and the critical time U, and all patient jobs of Ji 
must be scheduled by EXF. Thus we get 

EXFjJ) > max{^ m[ti - t \\ , ^ ^ Pj}, 

i i j^Vi 



where Vi is the set of all patient jobs of Ji. 



Lemma 12. For each principal period Ti, 

OPTjJi) < (1 -h ^)mjU - t\\ + ^ Pj, 

jeVi 

where Vi is the set of all patient jobs of Ji. 

Proof. Let J be a job of Ji scheduled to cross the time ti by OPT. In fact, 
there can be at most m jobs crossing ti. If the job J is not the patient job of 
Ji, its slack time is no more than [t, — t^\. From the slackness condition, the 
processing time f of J is less than or equal to the slack time divided by n, that 
is, i < \ ■ Futhermore, the total gain of all the jobs in Ji which can be 

scheduled to be completed before ti is at most m \ti — t\\ . Thus we obtain the 
bound. 

Now, we prove that EXF is (2 -|- i)-competitive, for k > 0, and we can also 
obtain the instance of the lower bound similarly to in [8]. 

Theorem 3. The EXF algorithm is (2 -|- -competitive, where k > 0. 
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Proof. From Lemma 11 and 12, it is shown that 

OPT{J) < ^(1 + hm[U - + E E 

< (1 + -)EXF{J) + EXF{J) = (2 + -)EXF{J). 
K n 



Lemma 13. For n> 



max 

J 



OPTjJ) ^ . 1 

EXF{J) - K 
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Abstract. Motivated by an application in thinwire visualization, we 
study an abstract on-line scheduling problem. Unlike most scheduling 
problems, our schedulers can gain partial merit from a partially served 
request. Thus our problem embodies a notion of “Level of Service” that 
is increasingly important in multimedia applications. We give two sched- 
ulers FirstFit and EndFit based on two simple heuristics, and generalize 
them into a class of greedy schedulers. We show that both FirstFit and 
EndFit are 2-competitive, and any greedy scheduler is 3-competitive. 
These bounds are shown to be tight. 



1 Introduction 

We study an abstract on-line scheduling problem motivated by visualization 
across a “thinwire” network [4, 3]. An example of such a visualization problem 
is a server-client model where the server and client are connected by a thinwire 
(that is, a bandwidth-limited connection such as the Internet), with the server 
holding a very large image that the client wishes to visualize. The viewer on the 
client side can control the transmission process by moving a mouse cursor over a 
low-resolution copy of the image to be visualized. This mouse motion generates, 
in real-time, a sequence of sampled positions along the mouse cursor trajectory. 
Each sampled position (x, y) corresponds to a request for higher resolution data 
at the position. As the bandwidth is limited, we could only partially serve each 
request. This is where an on-line scheduler is needed to optimize the decisions. 
In most scheduling problems, a partially served request does not contribute to 
the performance of the scheduler. However, in this problem, a partially sent data 
can still provide useful information to the user. Thus, instead of sending all the 
requested data, the server has the option of lowering the “level” of the requested 
service in order to gain an overall better response time. This paper focuses on 
this level of service property. Note that there is considerable interest in similar 
Quality of Service (QoS) issues in multimedia research. 

* This research was partially funded by NSF grant CCR-9619846. 
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We use the standard notion of “competitiveness” in the sense of Sleator and 
Tarjan [8]. to judge the quality of our online schedulers. A scheduler S produces 
a feasible schedule S{I) for each instance I of our scheduling problem. Each S{I) 
has an associated merit, where merit{S{I)) > 0. Let opt(/) denote any feasible 
schedule for I that maximizes the merit. We say S is c- competitive (c > 1) if for 
all I, 

merit{opt{I)) < c ■ merit{S{I)) + b, 
where 6 is a fixed constant. The competitive ratio of S is defined by 



C{S) := sup 
/ 



merit{opt{I)) 

merit{S{I)) 



Thus, we want schedulers S with C{S) > 1 as small as possible. There is a fairly 
large literature on competitive algorithms (e.g., [1, 2, 7]. The class of problems 
most closely related to ours is the online interval packing problem for a single 
server, where a schedule is a subset of non-overlapping intervals. Lipton and 
Tomkins [6] study a variant where the input intervals are sorted by their left 
endpoints. They give a randomized scheduler that is 2-competitive. As we will 
see, our problem is different from theirs in several ways. Woeginger [9] studied 
a problem that has several of the features of our problem. Other online interval 
packing problems can be found in [10, 5]. 



2 Problem Formulation 

We formalize our problem as an on-line scheduling problem. Each request q has 
four parameters 

q = {s,t,v,w), 

where s the start time, t the termination time (or deadline), v is the volume (or 
size), and w is the weight. We require 

V > 0 and w >0. 

Write st{q), dl{q), sz(q), wt(q) for the above parameters of q, respectively. Call 
the half-open interval (s,t] the span of q, written span{q). A request q can only 
be served within its span (s, t], and at any time moment %, at most one request 
can be served. 

An instance / is a sequence {qi,q 2 . . . qn) of requests where the start times of 
the qiS are in increasing order: note that we allow st{qi) = st(gi_|_i) even though 
we nominally say qi starts before qi+i- How requests are served is described by 
the schedule. Formally, a schedule for / is a piece-wise constant function 

iJ : {gi,g2,---,gn}U{0}, 

where \H~^{qf^)\ < sz(< 7 fc) and H~^{qk) Q span{qk) for k = 1, . . . ,n. Intuitively, 
H{to) = qk means the kth request is served at time to and H{to) = 0 means 
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no request is being served. A time moment to is called a breakpoint if H is 
discontinuous at to- (More precisely, for every £ > 0, there exists Si {0 < Si < e, 
i = 1,2) such that H{to) = H{to — 5i) yf + S 2 ))- We further require a 
schedule H to has finitely many breakpoints. In addition, for each request q, 
\H~^{q)\ < sz{q). A half-open interval of the form (to,ti] is called a time-slot. 
Without loss of generality, we may assume H~^{q) is a finite union of time slots. 
The merit merit{H) of a schedule is 

n 

Relative to a schedule H at any time to, we call 

v' := sz{q) - \H~'^{q) n (-oo,to]| 

the residual size of q. A request q is completely served if v' = 0. If v' > 0 and 
st(( 7 ) < to < dl(g), then we say q is pending. The residue of a pending q at time 
to is the modified request q' = {to,dl{q),v' ,'mt,{q)). 

For each completely served request, the scheduler gains sz{q)vit,{q) merit 
points (so weights are multiplicative). Moreover, partially served requests gains 
a proportional fraction of this merit. This is unlike usual scheduling problems 
in which partially served requests receive no merit. Our model is meaningful 
for the “foveated visualization” studied in [4, 3] because a scheduler can reduce 
the amount of requested visualization data along a finely graduated scale. This 
can be achieved by reducing two parameters, the foveal radius and/or foveal 
resolution. The foveal radius measures how fast the resolution falls away from 
the foveal center, and the foveal resolution measures the maximum resolution 
(which is at the foveal center). 

Preemption. It is implicit in the above definitions that the servicing of any 
request can be preempted as often as we like with no penalty. Hence we may 
imagine the scheduler to “plan” a schedule based on all the currently residual 
requests. It services the requests according to this plan until the arrival of a new 
request. Then it suspends the current plan, recomputes a new plan based on the 
new set of residual requests, and repeats the process. 

Optimal Schedules. We say H is optimal for I if merit{H) is maximum among 
all schedules for I. The existence of optimal schedules is not immediate. 

Lemma 1. For all sequences I of n requests, there exists an optimal schedule 
with at most 2n^ n breakpoints. 

The optimal schedules for an instance may not be unique. In the full paper, 
we give a canonical representation whereby the optimal schedule is unique. Let 
opt(/) denote the canonical optimal schedule for I. 

Ordering of Requests. The schedulers in this paper make decisions by giving 
priority to heavier weighted requests. In case wt{p) = wt{q), we resolve the tie 
by treating p as “heavier” than q if and only if p starts before q. 
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2.1 FirstFit 

The online scheduler that always serves the heaviest residual request at each 
moment is called the FirstFit scheduler. Figure 1 shows the schedule produced 
by FirstFit on an instance of two requests q\ and q2- Although this example 
may appear contrived, we can modify qi to q\ where q\ = (0, 2, 1, 1 + e). For any 
e > 0, the FirstFit schedule is the one shown in Figure 1. 



I 


gi 1 






Q2 1 






o' 1 


2 ' 



FirstFit(/) [ 
opt(/) [ 



Fig. 1. The top figure illustrates the instance I = (gi, <72) where <71 = (0, 2, 1, 1) 
and (72 = (0, 1,1,1). Each horizontal “dashed” line represents the span of the 
request. Although wt((7i) = wt((72), q\ is “heavier” than (72 by our tie-breaking 
rule. In opt(/), qi and (72 are served in the time-slots (1, 2] and (0, 1], respectively. 
However, in FirstFit(I), only <71 is served. 



2.2 EndFit 

Consider an online scheduler which always serves according to the optimal sched- 
ule for the current set of residual requests. This was first suggested by Estie 
Arkin^. To implement such a scheduler, we can invoke a general off-line algo- 
rithm for computing optimal schedules upon each new arrival of a request. But 
it turns out that a very simple scheduler can be used. This scheduler, on an 
arbitrary instance I, operates as follows: 

Starting from the heaviest request down to the lightest, allocates each request 
q £ I to the latest available time-slot(s) within span{q). 

Call this the Off EndFit scheduler. It is an off-line algorithm because it must see 
the entire set of requests to make its decisions. This scheduler is optimal for a 
special class of instances. 

Lemma 2. If I is an instance in which all requests have a common starting 
time, then DffEndFit(/) is the canonical optimal schedule for I. 

Let EndFit be the online scheduler which always serves according to the 
Off EndFit schedule for the residual requests. More precisely, on arrival of a new 



1 



Private communication (1997). 
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request q, EndFit preempts the current service. It computes a new schedule P 
for the current residual requests using DffEndFit, and continues by servicing P. 
Call P the plan upon arrival of q. Since all residual requests have a common 
starting time, P is the canonical optimal schedule. Figure 2 shows the EndFit 
schedule for an instance / = {qi, ( 72 )- The EndFit schedule for this example may 
appear contrived. To see that it is “correct in the limit”, let 1^ = (qoiqu<l 2 ) 
where qo = (0, 1, l,e). For any 0 < e < 1, the off-line optimal schedule for the 
current residual requests is unique. As e — >■ 0, EndFit(/<;) approaches the one 
schedule shown in Figure 2. 




Fig. 2. The top figure illustrates the instance I of two requests: <71 = (0, 2, 1, 1) 
and <72 = (1,2, 1, 1). In the opt(/), <71 and <72 are served. However, in EndFit(I), 
only < 7 i is served. 



3 Competitive Ratio of FirstFit 

Example 1 (Figure 1) shows that the competitive ratio of FirstFit is at least 
2. We will show that FirstFit is 2-competitive. Before presenting the proof, let 
us give two definitions. 

Charging Scheme. Let H, Hi and H 2 be schedules for an instance I. We often 
need to argue that the merit of H is no larger than the sum of the merits of 
Hi and H 2 - Our approach is to charge a portion of H to Hi and the remaining 
to H 2 - Intuitively, the charging process can be viewed as first cutting Hi and 
H 2 into pieces and then piecing them together again to form another piecewise- 
constant function iLchg- Each piece, after cutting, may be translated before being 
placed into their slot in iLchg- As it may turn out that \H^{q)\ > sz{q) for some 
request q, H^hg is not necessarily a schedule. The cut-and-paste is done in a way 
that for all t, wt(iJchg(t)) > wt(iL(t)). Therefore, 

merit{H) < merit{H chg) < merit{Hi) + merit{H 2 ) ■ (1) 

In particular, if Hi = H 2 , then we have merit{H) < 2 • merit{Hi). When we use 
the phrase: “charge {s',e'\ from H to Hi at (s,e]”, we mean that a piece (s, e] 
is cut from Hi and placed into the slot (s',e'] in iLchg- Equivalently, if H~^{q) 
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is the interval (s', e'], we may say that the request q is charged from H to Hi at 
(s, e] (see Figure 3). Thus, to show (1), we need to charge each time slot of H to 
either H\ or H 2 , and ensure that no part of H\ or H 2 is charged more than one. 




Fig. 3. Charging the request q in i/ to Hi at (s, e]. 



Intactness. A request q is intact in a schedule H if H~^{q) is connected, and 
either \H~^{q)\ = 0 or sz{q). Most of our proofs will be simplified if we assume 
intact ness. 

Theorem 1. For any instance I, 

merit(opt(/)) < 2 • merit(FirstFit(/)). 

Proof. Given an instance I, let Hff := FirstFit(I) and i?opt := opt(/). We can 
assume that requests in I are intact in both H±f and i?opt- 

Let Ho be an identical copy of Ha. We want to charge requests served in 
iLopt to Hq and H^. Let {ti,t 2 , . . . ,tm} be the distinct breakpoints in iLopt> 
where U < tj if and only it i < j. 

For each ti, starting from i:=l to m — 1, consider the time-slot (ti,C+i]. 
Let 9opt ■= HoptiU+i). Let qtf be the lightest request served during (fi,tt+i] by 
FirstFit. There are two cases: 

1. If the request q±f is not lighter than qopt, charge <7opt from iLopt to Ha at 
{ti , C+l] . 

2. Otherwise, charge gopt from iLopt to Hq at iL(C^(gopt)- 

We have to show that in the second case, |iL(C^(gopt)| > |(C, ti-i-i]|- In the first 
place, why is the weight of qa lighter? The request q±± is chosen by FirstFit 
because it is the heaviest request among the pending requests. This implies that 
gopt is not a pending request, even though U is in the span of gopt. So gopt 
must have been completely served by FirstFit. This implies that |^?o"^(9opt)| > 
|(ti, tj+l] |. 



4 Competitive Ratio of EndFit 

Example 2 (Figure 2) shows that the competitive ratio of EndFit is at least 2. 
We now show that this constant is the best possible. 
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The upper bound proof is considerably more subtle than the proof for 
FirstFit. The key result is Theorem 2 below which formalizes this observa- 
tion about EndFit: it never hurts the performance of EndFit to have a request 
started at an earlier time. For example, in Figure 2, the performance of EndFit 
will improve if the request <72 starts at an earlier time. The analogous fails for 
FirstFit. For example, in Figure 1, the performance of FirstFit would improve 
if qi starts at a time later than 0. 

A request g is a trimmed version of q if st(q) > st{q), dl(g) = dl{q), wt(g) = 
wt(q) and sz(q) < sz(< 7 ). Thus, a trimmed version of q may start later than the 
original q. An instance / is a trimmed instance of I if there is a one-one (not 
necessarily onto) mapping from I to I such that any g in J is a trimmed version 
of its corresponding request in I. Clearly, merzt(opt(/)) < merzt(opt(/)). Similar 
relationship also holds for EndFit. 

Theorem 2. If I is a trimmed version of I then 

merit(EndFit(/)) < merit(EndFit(/)). 

We now use Theorem 2 to show that EndFit is 2-competitive. 

Theorem 3. For any instance I, 

merit(opt(/)) < 2 • merit (EndFit (/)). 

Proof. We can assume that requests in I are intact in opt(/). Let I be the 
trimmed instance of I such that for any request g G I, if {opt{I))~^{q) = (^ 1 ,^ 2 ], 
then the corresponding trimmed request g satisfies st(g) := ti and sz{q) := t 2 —ti] 
otherwise if opt(/)“^(( 7 ) = 0, then q satisfies sz{q) :=0. We can further assume 
that I is intact in all the plans of EndFit with I. (Recall that a plan is the 
optimal schedule for the residues). By definition, we have 

merzt(opt(/)) = merzt(opt(/)). (2) 

The instance I has the nice property that requests arrive at a “constant rate”, 
that is, if a request q starts at time t, then no other request starts during (t, t + 
sz{q)). Let Hi and H 2 be two identical copies of EndFit (/). Our theorem is 
proved if we show how to charge requests in opt(/) to Hi and H 2 - 

Consider a request q in I. Let P'^ be the plan upon the arrival of q and P~ 
be the plan just before the arrival of q. There are two cases. 

1. If g is allocated in the new plan P+, then it is possible that there are some 
requests which are originally allocated in P~ , but not in the new plan. Call 
these requests the ousted requests. 

2. Otherwise, call q the ousted request. 

Let s be the total size of the ousted requests. Note that s < sz{q) and the total 
merit of the ousted requests is not more than the total merit of the requests 
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allocated in (st(q), st(< 7 ) + s] in P+. Furthermore, the new plan will be carried 
out without interruption at least until st{q) + sz{q). Charge the ousted requests 
to H 2 at (st(< 7 ), st(< 7 ) + sz(q)] and the served requests during (st(q), st(g) + 
sz(q)] to Hi at {st{q),st{q) + sz(g)]. The above is a valid charging scheme. 
Thus, we have 

2 • ment(EndFit(/)) > merzi(opt(/)). 

By Theorem 2 and (2), we have 

2 • ment(EndFit(/)) > merzi(opt(/)). 



5 A Class of Greedy Schedulers 

Looking at the behavior of EndFit and FirstFit on specific examples, it appears 
that they are complementary in the sense that if EndF it performs poorly on an 
instance, then FirstFit will perform well, and vice-versa. This suggests studying 
some combination of these two heuristics and motivates the generalization to a 
class of Greedy schedulers. 

A scheduler S in this class behaves as follows. 

(A) At the moment a new request q starts, it suspends the current service (that 
is, it preempts the currently served request). 

(B) Scheduler S computes a new plan H, which is a schedule for the set of 
residues of currently pending requests. We call H a ‘plan’ because the sched- 
uler may not carry out the schedule as planned due to the subsequent new 
requests. The plan H is computed by considering the residues one by one, 
starting from the heaviest request down to the lightest request. Let p be the 
request being considered and call \H~^{p) \ the allocation top. The allocation 
to p is subjected to the following restriction: 

(*) The allocation to p must be maximized. For example, if it is possible to 
completely allocate p, the whole of p must be allocated. However, there 
is no restriction on where p is allocated. Time-slots, once allocated, are 
not subsequently revised in creating this plan. 

(C) It carries out the plan until a new request starts, whereupon we go back to 
step (A). 

Different members of the Greedy class differ only in their strategies for (B) 
subjected to the restriction(*). Note that our first example FirstFit is a Greedy 
scheduler: the request p in (*) is allocated in the earliest possible time-slots. The 
second example EndFit is also a greedy scheduler. Its strategy for (B) is rather 
counter-intuitive: the request p is allocated in the latest possible time slots. 

By combining the counter examples for FirstFit and EndFit, we can find 
a Greedy scheduler whose competitive ratio > 3. The next theorem shows that 
this bound of 3 cannot be improved. 

Theorem 4. Every Greedy scheduler is 3- competitive. 
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6 General Lower Bound 

From the previous section, we know that every greedy scheduler is 3-competitive. 
Are there schedulers outside the Greedy class with competitive ratio less than 
2? We note a partial result in this direction: every deterministic scheduler has 
competitive ratio at least 2(2 — -\/2) > 1-17. 

In proof, consider this adversary: at time 0, the adversary releases two re- 
quests qo :=(0, 2, 1, 1) and qi :=(0, 1, 1, -\/2 — 1). At time t = 1, let the residual 
size of qo be sq- If so is less than 1/2 then the request (72 :=(1, 2, 1, 1) is re- 
leased. Otherwise, no further requests will be released. It may be verified that 
any deterministic scheduler achieve a merit of at most 2(2 ^^2) maximum 

possible. 

Unfortunately, this lower bound of 1.17 leaves a wide gap from the current 
upper bound of 2. On the other hand, no simple variation of this adversary seems 
to give a better lower bound. 

7 On the Number of Breakpoints and Multi-tasking 

Both FirstFit and EndFit make 0{n) breakpoints where n is the number of 
requests. Does the number of breakpoints affect the performance of a scheduler? 
We give an alternative formulation of the scheduling problem which could be 
viewed as allowing an infinite number breakpoints. In this multi-tasking envi- 
ronment, several requests can be served concurrently, but each at a (possibly) 
different rate. However, in any time interval of size At, the total size of requests 
served within this interval must not exceed At. Equivalently, we allow “fractional 
service” where the total service at any moment sums to 1. 

A concrete example of such a scheduler is FirstEndFit: It simulates FirstFit 
and EndFit concurrently and serves half of what FirstFit and EndFit would 
serve. That is, if FirstFit and EndFitwill serve q and p respectively in the time 
slot (sqUo]) then FirstEndFit will serve p and q concurrently but each at half 
the rate. We suspect that FirstEndFit is (3/2)-competitive. 

Note that FirstEndFit can be also viewed as the following randomized sched- 
uler under the original single-tasking setting: Before receiving any request, it 
tosses a fair coin; If the outcome is head, then it simulates FirstFit, otherwise 
it simulates EndFit. 

Clearly, the expected merit gained by this randomized scheduler is same as 
the merit gained by FirstEndFit. 

8 Conclusion 

We have formulated a “level-of-service” scheduling problem that arises naturally 
in our thinwire visualization applications. This formulation is also useful in real- 
time systems where quality of jobs can be traded-off for time. We have derived 
several competitive algorithms in this setting. We continue to investigate the 
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many interesting questions that are open. Besides sharpening the results in the 
paper, we pose the following directions for further work: (1) Find optimal sched- 
ulers which are not restricted to be on-line. (2) Study the problem with other 
measures of merit (instead of multiplicative weights in this paper). (3) Introduce 
a model of penalty for preemption. (4) Introduce randomization. 
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Abstract. We consider one-dimensional and multi-dimensional vector 
covering with variable sized bins. 

In the one-dimensional case, we consider variable sized bin covering with 
bounded item sizes. For every finite set of bins B, and upper bound 
1/m on the size of items for some integer m, we define a ratio r(B,m). 
We prove this is the best possible competitive ratio for the set of bins 
B and the parameter m, by giving both an algorithm with competitive 
ratio r{B,m), and an upper bound of r{B,m) on the competitive ratio 
of any on-line deterministic or randomized algorithm. The ratio satisfies 
r{B, m) > m/{m -\- 1), and equals this number if all bins are of size 1. 
For multi-dimensional vector covering we consider the case where each 
bin is a binary d-dimensional vector. It is known that if B contains 
a single bin which is all 1, then the best competitive ratio is 0(l/d). 
We show an upper bound of 1/2'^*'^“°^'^^^ for the general problem, and 
consider four special case variants. We show an algorithm with optimal 
competitive ratio 1/2 for the model where each bin in 13 is a unit vector. 
We consider the model where B consists of all unit prefix vectors. Each 
bin has i leftmost components of 1, and all other components are 0. 
We show that this model is harder than the case of unit vector bins, 
by giving an upper bound of 0(1/ log d) on the competitive ratio of 
any deterministic or randomized algorithm. Next, we discuss the model 
where B contains all binary vectors. We show this model is easier than 
the model of one bin type which is all 1, by giving an algorithm of ratio 
I?(l/ log d). 

The most interesting multi-dimensional case is d = 2. Previous results 
give a 0.25-competitive algorithm for B = {(1, 1)}, and an upper bound 
of 0.4 on the competitive ratio of any algorithm. In this paper we consider 
all other models for d = 2. For unit vectors, we give an optimal algorithm 
with competitive ratio 1/2. For unit prefix vectors we give an upper 
bound of 4/9 on the competitive ratio of any deterministic or randomized 
algorithm. For the model where B consists of all binary vectors, we design 
an algorithm with ratio larger than 0.4. These results show that all above 
relations between models hold for d = 2 as well. 



1 Introduction 

We consider on-line problems of covering variable sized bins. The one-dimensional 
version of this problem is defined as follows: We are given a finite set B of al- 
lowed bin sizes, each b € B satisfies 0 < & < 1 and the largest element of i? is 1. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 463-472, 2001. 
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Items with size in (0, 1] arrive on-line, each item is to be assigned to a bin upon 
arrival. The algorithm may assign a new item to a previously used bin, or open 
a new bin of any size in i?. A bin of size b is covered if the total size of items 
assigned to it is at least b. The goal of an algorithm is to maximize the sum of 
the sizes of all covered bins. This sum is the value of an algorithm. In this paper 
we consider bin covering with bounded item sizes. The size of each arriving item 
is at most I/m where m is a fixed integer. 

The multi-dimensional version is defined as follows: For a dimension d we are 
given a set of d-dimensional bins B, which is a subset of {0, l}"^, i.e., all com- 
ponents of the allowed bins are binary. The “vector weight” of a d-dimensional 
vector b is the sum of its components, this applies both to multi-dimensional 
bins (“bin weights”) and to multi-dimensional items (“item weights”). In the 
one-dimensional case the “bin weight” of a bin is simply its size and the “item 
weight” of an item is the size of the item. Vector items with size in [0, arrive 
on-line, and as in the one-dimensional case, each new item is to be assigned to a 
new or previously used bin upon arrival. A bin of size b is covered if the vector 
X, which is the sum of all items assigned to the bin, satisfies x > b. The goal 
of an algorithm is to maximize the sum of the bin weights of all covered bins. 
This number is the value of the algorithm. We study the general problem of 
arbitrary sets B, and also consider three interesting cases of multi-dimensional 
variable sized vector covering. The first case is where B consists of all possible 
binary vectors. The second case is when all bins are unit vectors (have only one 
non-zero coordinate) . In the last case we consider a unit prefix sequence of bins, 
each bin has i leftmost 1 coordinates, and all others are 0. 

Applications: Variable sized bin and vector covering relate to a case where a 
large number of jobs of a finite type are to be done (these are the bins). Each 
job is done by one worker or by a combined work of more than one worker (the 
workers are the items). An item represents the amounts of work a worker can 
do in a day. Each worker is assigned to one job (bin). The benefit of a job is 
the minimum amount of work to do it, which is proportional to its size. This 
scenario relates to one dimensional bin covering. 

The scenario of vector covering is equivalent to the case where a job has d 
different qualities. We can normalize and assume that each quality which exists 
in the job has size 1, each type of job has to have a subset of the qualities, the 
different types of jobs are again represented by bins, which are d-dimensional 
binary vectors. The work of each worker is also represented by a d-dimensional 
vector which corresponds to the parts of each quality that the worker can do in 
one day. 

Definition of competitive ratio: We measure the performance of an on-line 
algorithm by the competitive ratio. Denote the value of the on-line algorithm 
by Von, and the value of an optimal off-line algorithm, that knows the sequence 
in advance, by Vopt- The competitive ratio is the supremum r (r < 1) for which 
Von > f • Vopt — ij is satisfied for every sequence (rj is an additive constant 
which does not depend on the input). For randomized algorithms we replace Von 
by E(Von) and in this case the competitive ratio is the supremum r for which 
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E{Von) > rVopt — rj is satisfied for every sequence. An on-line algorithm is called 
optimal, if it has a competitive ratio which is the best possible competitive ratio. 
Note that in variable sized covering, the competitive ratio can be constant, or a 
function of B. 

Known results: The one- dimensional version of the vector covering problem 
(for the case that all bins are of size 1) was first investigated in the thesis [2] of 
Assmann and in the journal article by Assmann, Johnson, Kleitman and Leung 
[3] . There it is proved that the greedy algorithm (that simply keeps putting items 
into the same bin until it is covered and then moves on to the next bin) has a 
worst case guarantee of 1/2. Moreover, two more sophisticated algorithms were 
derived with worst case ratios | and |, respectively. Both of these sophisticated 
algorithms are based on presorting the items, and consequently are off-line al- 
gorithms. The greedy algorithm, however, is an on-line algorithm. Csirik and 
Totik [8] proved that in fact the greedy algorithm is a best possible on-line al- 
gorithm, since no on-line algorithm can have a worst case ratio that is strictly 
better than 1/2. Csirik, Frenk, Galambos and Rinnooy Kan [7] gave a proba- 
bilistic analysis of the one-dimensional bin covering and of the two-dimensional 
vector covering problem (for the case that all bins are of size (1,1)). Gaizer 
[10] constructed an off-line approximation algorithm with worst case guaran- 
tee 1/2 for dimension d = 2. The article by Csirik and Frenk [6] summarizes 
all results on vector covering problems that were derived till 1990. The multi- 
dimensional vector covering problem was studied by Alon, Azar, Csirik, Epstein, 
Sevastianov, Vestjens and Woeginger in [1]. The paper considers the case where 
the only allowed bin is the “all 1” vector. This paper considers both on-line 
and off-line vector covering. [1] gives an on-line algorithm with competitive ratio 
l/{2d) and an upper bound of l/(d + 0.5) on the ratio of any deterministic or 
randomized algorithm. Finally Woeginger and Zhang [11] studied variable sized 
one-dimensional vector covering. They define a ratio r(B) which is the minimum 
between the minimum ratio of two consecutive bin sizes (ignoring all bins smaller 
than 1/2) and the inverse of twice the size of the minimum sized bin among the 
bins which are larger or equal to 1/2. Clearly, this number is smaller than 1/2. 
They show this ratio r{B) is optimal for a finite set of bins B. A survey on 
on-line packing and covering is given in [9] . 

Our results: We give some definitions in order to define the number r{B,m). 
Let B = {bi, . . . , 6g} be the set of allowed bin sizes, where 1 = 6i > 62 > . . . > 6s, 
and let m be an integer m > 1 such that 1/m is an upper bound on item sizes. For 
each 1 < i < s, let bij = bi/j and let Bi{m) be the set of fractions of bi between 
sizes l/(2m) and 1/m, that is Bi{m) = {bij\l < j < 2m} fl [l/(2m), 1/m]. 
We define a new set of bins by C(m) = Ui<i<sBi{m). Enumerate the sizes of 
numbers in C{m), C{m) = {ci,...,Cfe| where 1/m = ci > C2 > . . . > Cfe = 
l/(2m). For every element in C{m), recall an original bin size that caused it to 
be inserted into C{m). For Ci, let 6(cj) be the smallest bj in a way that there exists 
an integer y that satisfies yci = bj. Now define q{B,m) = max{ci/ci+i|l < i < 
k — 1}. Note that 1 -I- 1/m > q{B, m) > 1. Finally define r{B, m) = l/q{B, m). 

We show the following results for bin covering. 
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— For every finite set of bins B and integer m > 1, we give a deterministic 
algorithm with competitive ratio r{B, m). 

— For every finite set of bins B and integer m > 1, we give an upper bound of 
r(B,m) on the competitive ratio of any algorithm. 

These results show that r{B,m) is the best possible competitive ratio for the 
set of bins B, and the upper bound 1/m. This result reduces to the result of 
Woeginger and Zhang [11] for the case m = 1, and to the results of [3,8], if both 
m = 1 and i? = {1} hold. If only B = {1} holds, then it follows from our result 
that the best competitive ratio is ml{m + 1). 

We show the following results for vector covering. 

— An upper bound of on the competitive ratio for the vector cov- 

ering problem for arbitrary sets B. 

— An algorithm with competitive ratio 1/2 for the case where each bin in B is 
a unit vector. 

— An upper bound of 1/2 on the competitive ratio for this case 

i d—i 

— For the case of unit prefix vectors, (i.e. bins are of the form (1, . . . , 1, 0, . . . , 0), 
and \B\ = d) we show that this model is harder than the case where B 
consists of all unit vector bins (for large enough values of d) by giving an 
upper bound of 0(1/ log d) on the competitive ratio of any algorithm. 

— For the same case we show that the relation between models holds also for 
d = 2 by giving an upper bound of 4/9 on the competitive ratio of any 
algorithm for d = 2. 

— For the case where B is the set of all possible binary vectors, we show that 
this model is easier than the basic model of one bin type which is all 1 (for 
large enough values of d) by giving an algorithm of ratio 12(1/ log d). 

— For the same case we show that the relation between models holds also for 
d = 2 by giving an algorithm of ratio larger than 0.4. 

All upper bounds (negative results) in this paper hold both for determinis- 
tic and for randomized algorithms. In order to prove upper bounds we use an 
adaptation of Yao’s theorem [12] which states that an upper bound of a de- 
terministic algorithm against a fixed distribution on the input is also an upper 
bound for randomized algorithms. For maximization problems the bound is given 
by E (VonlVopt) or by E{Von) / E{Vopt) [4,5]. All upper bounds are proved for al- 
gorithms that are allowed to have additive constants. In order to prove upper 
bounds for such algorithms, we show that all upper bounds hold for arbitrarily 
long sequences. 

2 One-Dimensional Covering 

In this section we show that for every set of one-dimensional bins B and integer 
m > 1, the ratio r{B,m) is the best possible competitive ratio. 

Theorem 1. For every collection of bin sizes B, and parameter m, there exists 
an algorithm for bin covering with asymptotic worst case ratio r{B,m). This 
result is best possible among all on-line randomized algorithms. 
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We start with the algorithm. Later we prove the upper bound which shows 
its optimality. 

For a given set of bins B, and parameter m, we will use q for q{B, m), and r 
for r{B, to). Let t=\— log 2 m{q — 1)] . Define a partition of the interval of items 
(0, 1 /to] into the following subintervals: 

— For 1 < 7 < A: — 1 and 0 < Z < t — 1, let /, ; = (c,+i/2^, c,72^]. 

- Let4,i = (0,Cfc/2‘-ij. 

We assign each interval Ijj a corresponding bin size djj. We define dk,t to 
be bi = 1. For each other interval Ijj we define dj^k = The algorithm 

keeps one open bin of for each interval, this bin is used only for packing items in 
the interval. Each new item is classified and assigned to the corresponding bin. 
When a bin is covered, it is closed, and a new bin of the same size is opened and 
used for this interval. 

Lemma 1. The competitive ratio of the above algorithm is at least r 

Proof. We show that each covered bin is covered by at most q times its size. 
For every first type interval Ijg we show that its corresponding closed bins are 
covered by at most Cj/cj^i times their size. Let 6(cj+i) = ycj+i, then each such 
bin is covered by exactly 7/2* items of size at most Cjj^ , and the total size of 
the items that cover it is at most ycj. Thus the ratio between the size of items 
that cover the bin and the size of the bin is at most Cjjcj+i as required. Since 
q > Cjjcj+i, the bin is covered by at most q times its size. 

For the interval since all items are bounded by 1/(2 *to), a covered bin 
is covered by items with total size at most 1-|-1/(2 ‘to). Since 2* > l/(TO(g— 1)), 
1-1-1/ (2 *to) <q— \ + \ = q, and each such bin is covered by at most q, which 
is q times a size of a unit bin. 

The total number of bins that are never closed is 0{kf), which is 0{msf) 
where s = \B\, this number depends solely on B and to, and is an additive 
constant. 

Lemma 2. The competitive ratio of any deterministic or randomized algorithm 
for bin covering is at most r. 

Proof. We start with the deterministic case. Let j be an integer such that 
Cj/cj+i = q. Let y be an integer such that ycj = b{cj). Let be a large integer. 

In order to define a list of items, we define a set of sizes D. Let D = B U C 
where C = {xd\x G {1, . . . , 2to}, c' G C} fl [1/2 to, 1]. Enumerate the numbers in 
D, D = {di , . . . , dp} where d\ > d^ > . . . > dp. Now define e > 0 to be a number 
that satisfies 2m ■ 2^e < bg and for every 1 < j < _p — 1, 2m ■ 2^e < dj — dj+i. 
The list contains 2^y items with size e (sand) followed by 2 ^“®t/ items with size 
Cj — 27 for a fixed i, 0 < i < N . 

The optimal off-line assignment uses bins with size b{cj) to pack all items. 
Each bin contains y items with size Cj — 2*e and 2*7/ items with size e. This covers 
2^“* bins and Vopt = 2^~’’b{cj) = 2^~'^ycj. We change the packing of the on- 
line algorithm without reducing the value of the on-line algorithm in such a way 
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that the on-line algorithm will use only bins of weights Cj and Cj+i. Consider a 
bin of the on-line algorithm of size b which is covered, i.e. it contributes some 
amount to the on-line algorithm value. Since y < 2m, and b > bg > 2m2^ e, 
the bin contains at least one big item. Let a > 0 be the number of big items 
in the bin. Assume that a < 2m, otherwise remove items and let a = 2m, since 
items are larger than l/2m and the bin is of size at most 1, then the bin is still 
covered. 

Let /i be the total size of small items in the bin. If /i > 2'^ae replace the 
bin by a bins of size Cj containing each one big item, and [/x/(a£)J small items. 
Otherwise, partition the items in the same way, but use bins of weight Cj+i 
instead. 

We show that the new bins are covered and that the value of the on-line 
algorithm is not reduced, i.e. b < acj in the first case and b < acj+i in the 
second case. In the first case since y,/a> 2*£, the total weight of items in a new 
bin is (cj — 2*e) -|- [/x/(ae)Je > cj. Hence each new bin is covered. On the other 
hand b < acj -\- y2^e. According to the definition of D, acj G D. Assume to the 
contrary that b > acj. Since b G B then b G D, thus b — aCj > y2^e and hence 
the bin would not be covered. We conclude that b < acj, and the replacement of 
6 by a bins of size Cj does not change the on-line algorithm value and each new 
bin of size Cj is covered by at least cj. In the second case. If /r < 2^ae, the weight 
of an item in a new bin is at least Cj — 2*e > Cj+i . Hence each new bin is covered. 
On the other hand b < acj. Assume to the contrary that acj+i < b < acj, then 
6/a G C, contradicting the fact that Cj in the successor of Cj+i in C. We conclude 
that new bins are covered, the on-line algorithm value was not reduced, and now 
the on-line algorithm uses only bins of sizes Cj and c^+i. 

For 0 < i < A^, let Zi be the number of on-line algorithm bins with the 
amount of least 2*£ weight of sand but less than 2*+^£. Let be the number 
of on-line algorithm bins with weight of sand at least 2^ . We have — 

y2^ . The on-line algorithm value would be . . .+ZM)+Cjj^i{2^~'^y— 

(Zi Zjv)) = ycj+\2^ * -|- (cj — Cj+i)(Zi + Z„). 

We multiply the on-line and off-line algorithms values for every i > 1 by 
21 - 1 / 2 ^'^, value for f = 0 by 1/2^. For the on-line algorithm we get 

yCj+iiEh 2 ^- 2 - 72 ^ + 2^/2^) + (c, - c,+i) Z^2^2^ < y(c,+^(N + 

2)/2 + (cj — Cj+i)) = y/2((N + 2)cj_|_i -I- 2cj — 2cj_|_i) = y/2(Ncj^i + 2cj) 

. For the optimal off-line algorithm we get /2^ ■2^~'^ycj + l/2'2^ycj = 

(yc,/2^)(N + 2)2^-i = yCj(N + 2)/2 . 

If the competitive ratio is r and q = Ijr, then also for the convex sums 
yCj(N + 2)/2 < qy/2(Ncj+i + 2cj) or q > (N + 2)/((7V(cj+i/cj) -I- 2). Hence 
r = Cj+ifcj + S]s[, where 6^ is arbitrarily close to 0, for large enough values of 
N. 

For the randomized case we use the adaptation to Yao’s theorem, and the 
probabilities are the factors we multiplied by in the deterministic case. We get 
that E(Von)/E(Vopt) < Cj+i/cj. 
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3 Multi-dimensional Vector Covering 

In this section we consider variable sized vector covering. In the classical vector 
covering problem, B contains a single bin which is all 1. We study a simple 
extension of this model, and allow each bin to be a binary d-dimensional vector. 
We start by showing upper bounds on the competitive ratio for arbitrary sets 
B. The proof of the following Lemma is omitted. 

Lemma 3. The competitive ratio of any deterministic or randomized on-line 
algorithm for binary variable sized vector covering is at most 

Due to this negative result, we do not study the general model, but restrict 
ourselves to specific interesting cases of B. We show some relations between the 
different models. 

3.1 Covering a Single Type of Bin 

We start with an easy model where B consists of a single type of bin b. Assume 
that the number of non-zero components in b is i, then we can reduce this 
problem to the case of vector covering bins of dimension i, where all components 
are 1 (the basic vector covering problem). This is true since both the on-line 
and off-line algorithms do not use the coordinates which are zero in the vector 
b. Thus the best competitive ratio for this case is 0{l/i) [1]. 



3.2 Covering Unit Vector Bins 

In this model we let B consist of bins which have one non-zero component. We 
show that for any such set of bins, the best competitive ratio is 1/2. We consider 
this model since this is a case where all bins have the same bin weight. 

The upper bound (negative result) follows from the bound of [8] by picking 
one b G B, where the zth coordinate of b is non-zero, and giving a sequence 
where all vectors are d-dimensional, with all coordinates zero, except for the zth 
coordinate which is the same as in [8]. We can also give an easier proof for the 
case where \B\ > 1, we omit this proof due to space restrictions. 

Lemma 4. If B consists of unit vector bins only, and \B\ > 1, then the com- 
petitive ratio of any deterministic or randomized on-line algorithm is at most 
1 / 2 . 

We give an optimal greedy algorithm. We assume that \B\ = d and it contains 
all bins of unit bin weight. Otherwise we just cancel the coordinates that do not 
appear in any bin (are zero in all bins), since no algorithm can benefit from these 
coordinates, and treat the input as a lower dimension vector. 

The algorithm has at most d open bins at a time. Each arriving vector is 
classified to a type among {1, 2, . . . , d} according to its largest component. 

Denote a bin whose zth coordinate is 1 by 6^. After an arriving item is clas- 
sified to a type j, then if there is an open bin of size bj, the item is assigned to 
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it, and if after the item is assigned to a bin, the bin is covered, we close the bin 
(and never use it again). 

If no bin of size bj is open, open a new one and continue in the same way. 
Theorem 2. The competitive ratio of the greedy algorithm is at least 

Proof. For the optimal off-line algorithm, define by Sopt, the active area of items, 
that is, the area of items that was used to cover bins (i.e. the area that is not 
wasted, which is in the case of unit vectors, 1 for each covered bin). Clearly, 
Vopt = Sopt- Since all bins have only one non-zero coordinate, the active area of 
each item is in one coordinate. For an item a, let Umax = maxi<i<dai. Clearly, 
Sopt Si ®max) where A is the sequence of items. 

On the other hand, consider the area X)oe.4 ®max, the on-line algorithm 
packed according to the maximum component. 

Let at be the number of bins of type bi that the on-line algorithm managed 
to cover. For each such bin, the ith coordinate is covered by less than 2 (since 
at the time the last item was assigned there, the coordinate was less than 1 , and 
the new item adds at most 1). There is at most one open bin which is never 
closed, and its ith coordinate is also less than 1. Hence “max < -|- 1, 

where At are items that were classified to type i. Summing over all 1 < i < d we 
get: XaGA “max < 2Vo„ -f d, which gives Von > \Vopt - i- 

3.3 Covering Unit Prefix Bins 

We define a model which is similar to the previous one. Let Bi be a set that 
contains exactly d possible bin sizes. Bin i, which also has bin weight i, is a 
d-dimensional vector whose i leftmost coordinates are 1, and all others are 0. 
We show that using this set of allowed bins instead of d unit vector bins, makes 
the problem of on-line covering harder. 

Theorem 3. The competitive ratio of any deterministic or randomized on-line 
algorithm for covering the increasing set of bins has competitive ratio of at most 
0(1/ log d). 

Proof. The upper bound sequence consists of 2k items with size a = (1 — 
e, 0,0, ... ,0) and then with probability pi (which we fix later), 2k items with 
size (e, 1, . . . , 1, 0, . . . , 0) arrive {d — i zeros), and for 2 < i < d, with probability 
Qi, (which is also fixed later), k items with size (0, 1, . . . , 1, 0, . . . , 0) arrive {d — i 
zeros). qi is the probability that no further items (except the first 2k items) 
arrive. Hence the probabilities should be fixed so that Xfcife + di) = 1- 

Let at be the number of bins of bin weight i that the on-line algorithm opens 
for one item of size a, and j3i to be the number of bins of bin weight i it opens, 
assigning two items with size a to each. 

Let us calculate the competitive ratio for each case: Consider the cases where 
items with d — i rightmost zero coordinates arrive. With probability pi, the on- 
line algorithm manages to cover all bins of weight of at most i that it opened. 

The optimal off-line assignment would use only bins of bin weight i, and 
would assign one item of each type to every bin. Thus Copt = 2ki, and Con = 
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+ /^i) ■ j- the other hand, with probability qi, the on-line algorithm 
would cover only bins with at least two items with size a (otherwise the leftmost 
component is not covered). 

The optimal off-line assignment would use only bins of bin weight i, and 
would assign two items with size a, and one item of the other type to each 
bin. Thus Copt = ki and Con = X)j=i ’ J- According to the initial on-line 

assignment, we have + 2/?i) < 2fc. Let Pi = qt = i < d and 



Pd = qd= 2d- 

The expectation of the optimal off-line value is E{Vopt) = + 

q,ki) = 2 (i^ + 5) == ^ + l) = i^(Eti i) > \khid. 

It is possible to show that the expectation of the on-line value satisfies E{Von) < 



k. The probabilities are valid since = Ef=i i(i+i) 7 

- 1-2 = 1. Thus the competitive ratio is at most ^ ^ _ 2fc 



Eti (' 






In d 



= o 



(logd) ■ 



We show that this problem is significantly harder than unit vector bins, not 
only asymptotically but for d = 2 as well (in this case B = {(1, 1), (1, 0)}). 
The Lemma follows from a more careful examination of the general sequence for 
d = 2. The full proof is omitted. 



Lemma 5. The competitive ratio of any deterministic or randomized algorithm 
for d = 2 and unit prefix bins is at most 4/9. 



3.4 Covering All Possible Bins 

In this model, B consists of 2*^ — 1 bins, i.e. each binary d-dimensional bin may 
be used. We show that the basic covering model (the only allowed bin is the “all 
1” bin) is harder than this model. We define the following algorithm. 

Let k = |"log 2 (d^)]. We define type vectors by vectors in the set A‘^ where 
A = {0} U I 0 < i < A:}. consists of {k + 2^ different type vectors. The 
algorithm keeps one open bin of a corresponding size for each vector type. To 
determine the bin vector for a certain type vector we do the following: 

For each 0 < s < A:, let as be the sum of sizes of all components of size ^ of the 
type vector, which is p, where m is the number of p components. Let Si be an 
integer for which as is maximized. The corresponding bin would have zeros in all 
coordinates which are less than pj- in the type vector, and 1 in all coordinates 
which are at least pj-. 

We keep one open corresponding bin for each possible type vector. On arrival 
of an item vector a, a is identified with a type vector, then it is assigned to a 
bin that corresponds to this type. A covered bin is closed and replaced by a new 
bin of the same size. 

For each arriving item vector a, compute its type vector in the following way: 
Let i be the coordinate such that at = Let a' be the vector p. 

Replace each coordinate a' by the maximum number A, such that a' > A, jf 
s > k, then replace a' by zero. The new vector a" is the type vector of a. Let 
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o!" = a" ■ ai- a is assigned to the open bin which corresponds to the type vector 
a”. 

Theorem 4. The above algorithm has competitive ratio 

The proof of the Theorem 4 is omitted. It is also possible to design an algorithm 
for d = 2, and prove the following theorem. 

Theorem 5. There exists an algorithm for d = 2, for covering with all possible 
bins, which has competitive ratio strictly above 0.4. The algorithm divides items 
into ranges, and packs each range separately. 
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Abstract. We consider a general methodology proposed by Chen and 
Kao [4] for testing polynomial identities. We prove that the test cannot be 
completely derandomized by any specified set of rational approximations 
to algebraic numbers up to a polynomial number of bits. The proof is a 
direct application of Dirichlet’s box principle. We also give some number 
theoretic estimates for the likelihood of a multiplicatively independent 
sequence of integers which can be used in their algorithm. 



1 Introduction 

Randomization has been used very successfully in the field of algorithm design 
and complexity analysis. The first widely acclaimed use of randomization as 
a computational resource with provable polynomial time performance bounds 
can be traced to Solovay and Strassen [13] and to Rabin [11] following work by 
G. Miller [8]. These results were for testing primes. Another early work using 
randomness to achieve provable bounds in computation is the Schwartz-Zippel 
Theorem around 1978 [12] [14]: 

Theorem 1 (Schwartz-Zippel). Let F{xi, . . . ,Xn) G F[xi, . . . ,Xn] he a mul- 
tivariate polynomial of total degree d, where F is any fixed field. Suppose F is 
not identieally zero. Then, for any fixed finite set S CF, 

Pr[E(ri,...,r„) = 0] < 

where ri, . . . , r„ are chosen independently and uniformly at random from S. 

This elegant result is used to test for polynomial identities. Thus, if we wish 
to test for /(x) = g(x), we simply pick a set of random inputs r and check 
that /(r) = 5 (r), with the understanding that if /(x) = gfx), then certainly 
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/(r) = g{r) for any r, and, if /(x) yf ff(x), then /(r) yf ^(r) with high probability. 
The error probability is only on one side, and can be reduced by either choosing 
a larger set S, or, as a more general principle, by running the probabilistic test 
many times. There are more advanced techniques for amplification (see [9,15]). 

The Schwartz-Zippel method is a general technique for testing polynomial 
identities. Not only it can be used in straightforward algebraic settings, but 
also it can be used to give elegant probabilistic algorithms for combinatorial 
problems. A well known application is the probabilistic NC^ algorithm for the 
perfect matching problem by Lovasz [7] . 

A perfect matching in a graph G is a set M of edges, such that every vertex of 
G is incident to exactly one edge in M. The perfect matching problem is to decide 
if a given graph has a perfect matching. Tutte gave a beautiful characterization 
of the existence of perfect matchings in a graph G. Define the Tutte matrix T 
as follows. Suppose G has n vertices and m edges. Let {xij} be a set of m 
indeterminants, one for each (undirected) edge (i,j)- For any edge (*,j), with 
i < j, put an Xij in the (i,j) position and —Xij in the (j,i) position, and 0 
elsewhere. 

Theorem 2 (Tutte). G has a perfect matching if and only if detT yf 0. 

Then Lovasz’s RNC^ algorithm for the perfect matching problem is to apply the 
probabilistic test for polynomial identity and test for zero. More specifically, his 
algorithm assigns to each indeterminant a uniformly and independently chosen 
integer in the range {!,..., 2n}. Note that the total degree of det T is (at most) 
n, and thus the algorithm achieves success probability 1/2 with m ■ [log 2 ( 2 n)] 
random bits. 

There are other randomized fast parallel algorithms for the perfect match- 
ing problem. For instance, Chari, Rohatgi and Srinivasan [3] gave an alternative 
algorithm that uses fewer random bits, but the processor count is a large poly- 
nomial in n (it is estimated that the algorithm by Chari et al needs to compute 
determinant and inverse of n x n matrices of entries of size up to see [4].) We 
note that it has been a long standing open problem to achieve a deterministic 
NC^ algorithm for the perfect matching problem. 

Recently Chen and Kao [4] proposed a new general methodology for testing 
whether a polynomial with integer coefficients is identically zero. Their method is 
based on a theorem on the non-vanishing of a polynomial on algebraic numbers. 
Their motivation is to derandomize the RNC^ algorithm for the perfect matching 
problem. The general idea of their approach is that, instead of evaluating the 
polynomial at random integer inputs, it can be evaluated at various algebraic 
numbers. 

More specifically, let pi = 2,p2, . . . ,Pn be the smallest n primes, and let 
f{xi, . . . ,Xn) be a non-zero multilinear polynomial with integer coefficients. 
Then 

/(±v^> ±v^> • ■ • ^ 0. 

More generally, let /(xi, . . . , x„) have degree di on Xj. Let ki = [log 2 (di-l-l)] , and 
s = ki- Let qip , . . . , gpfcj , . . . , g„q, . . . , qn,k„ be any s square-free positive 
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integers which are pair-wise relatively prime, and let 61 ^ 1 , 
bn,k„ be any s bits from {0, 1}. Then / is non-zero if and only if 

( fel fc2 fcn \ 

i=i j=i i=i / 

This does give a deterministic test, albeit one which involves irrational num- 
bers, and thus its exact computation is computationally infeasible. 

To overcome this difficulty, Chen and Kao used a certain random combination 
of rational approximations to the irrational algebraic numbers. Indeed, in the 
above deterministic test there is no need for the s bits bij at all; these were 
included for the purpose of using randomization. More specifically, they used 
a random choice of the s bits, with a truncation of the quadratic irrational 
numbers to a prespecified polynomial number of bits £. They showed that to 
achieve good success probability, £ = |"log 2 (c -I- cd)~\ + c?|"log 2 n] -I- 2 d[log 2 s] -I- 
d -I- 1 suffices, where d is the total degree of / and c is its maximum absolute 
value of coefficients. In particular, they suggest using simply the truncations 
of ±y^, . . . , for multilinear polynomials, where the signs ± are 

randomly chosen. 

With this method, they arrived at their randomized NC^ algorithm for the 
perfect matching problem. It turns out that the number of random bits used 
is only X^r=i max{l, rii}] , where rii is the number of neighbors Vj of the 
i-th vertex Vi in G such that j > i. This allowed them to achieve an RNC^ 
algorithm that uses fewer random bits without doing more work than all previous 
randomized NC algorithms for the perfect matching problem. They also used 
their general methodology for several other problems. 

In this paper we show that the use of prime square roots \/p 2 , . . . , y/Pn 
is unnecessarily restrictive. Even if we use square-free positive integers which 
are pair-wise relatively prime, it is still unnecessarily restrictive. Instead we only 
need numbers that are multiplicatively independent modulo squares. Note that 
it is computationally easy to test for this property of multiplicative indepen- 
dence modulo squares [1]. Moreover, for polynomially many random integers, it 
is exponentially unlikely that they are pair-wise relatively prime. On the other 
hand, it is superpolynomially unlikely that they are multiplicatively dependent 
modulo squares. We prove these facts in Section 3. 

While their algorithm reduces the number of random bits required, the pos- 
sibility remains that perhaps one might be able to eliminate randomness alto- 
gether, by choosing carefully a specific set of signs in the rational combination of 
irrational numbers and a sufficiently good rational approximation to each of the 
irrational values, and then evaluating. As their methodology is a general one, the 
general question then is whether there are such universal choices. The authors of 
[4] consider this the most intriguing problem left open from their approach [5]. 
In Section 4 we show that this is in general impossible. 
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2 Algebraic Preliminaries 

Let Q be the field of rational numbers. Let Q* be the multiplicative group of 
Q, and Q*^ be the subgroup of squares, i.e., Q*^ is the image of x from 

Q*. By the unique factorization of integers, the quotient group R = Q*/Q*^ 
is a direct sum of countably many copies of Z 2 generated by po = and by 
Pi = 2, . . . ,p„, . . ., respectively, where p„ is the n-th prime, each of which has 
order 2 in R. 

Let /3 i,/32, • ■ • ,Pm be any m multiplicatively independent elements in R, i.e., 
for any non-empty S C {!,..., m}, JlieS A 7^ 1 in R- Results of the following 
type are generally referred to as Kummer theory (see Lang [6] Chapter VIII 
Section 8). However we present the following elementary proof. 

Lemma 1. Let Qm = Q(v73i> vTi, • ■ • , ^fK)- Then [Q™ : Q] = 2™. 

Proof. Write each (3j as a 0-1 vector of the exponents mod 2 over primes as 
follows. By unique factorization, (3j = Wi^^P^'T"^ ^ where G Q*^ and e* mod 2 
are uniquely defined. (Here po = — 1 and pt,i > 1 are all the primes.) The 
uniqueness claim is just the elementary statement that for distinct primes pi, 
±Y\pi yf 1 is not a square in Q*. Then the 0-1 vector for j3j is (eo,ei,...). 
It is actually a finite vector, one entry for every prime appearing in (3f, or one 
can think of it as an infinite 0-1 vector, with all but finitely many entires 0. 
This establishes an isomorphism between R and 0 Z 2 . Now one can perform 
Gaussian elimination on the matrix over Z 2 whose rows are these 0-1 vectors 
for /?i, (32 , . • . , (3m- (In matrix terms, the elementary row/column operations are: 
exchange of two rows or columns, adding mod 2 of one row to another.) Being 
multiplicatively independent is equivalent to the rank of this matrix being m over 
Z 2 . We can arrive at a normal form after Gaussian elimination [/^ B], where 
Im is the m X m identity matrix and B is some matrix over Z 2 with m rows. 
If «i, Q! 2 j • ■ • , C(m are the elements corresponding to the rows, then there are m 
distinct 91 , ( 72 , ■ • • , 9m G {Po,Pi, ■ ■ ■}, such that aj = qjVj, where every rj is a 
product of Pi’s other than 91 , 92 , • ■ • , 9m- In terms of field extension: 

Qm ~ Q('\Ah; y/Rf., • • • , yfRTfff). 

Denote Qj = Q(i/«i, ■ ■ ■ > V^)- 

Clearly [Q^ : Q] < 2™ as [Qj : Qj-i] < [Q(^aj) : Q] = 2. There is 
an injective homomorphism from the Galois group G = Gal(Qm/Q) to the 
permutation group which maps yWJ to iyddj, for every j. The full permutation 
group is the dual group of Z™ which has order 2’”. We want to show that this 
homomorphism is surjective, and thus G is isomorphic to the dual of Z™ and 
in particular has order 2*” as well. Being the splitting field of 
Qm is Galois over Q, and thus by the fundamental Galois correspondence [Qm : 
Q] = |G| =2"^. 

To show that |G| = 2™, we represent each cr G G as a 0-1 vector 
by the action on {y/aj}, a{^a(j) = Since each aj has a distinct 

Qj, we have the following important property of G: For any non-empty subset 
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of indices I C {1, . . . ,m}, let aj = then there is an automorphism 

T G Gal(Q(i/o7)/Q) which maps ^/aj to — yd/. This r extends to some a in 
G. In terms of the 0-1 vectors (crj), for every non-empty subset / of {1, , m}, 
there is some (cTj), such that t mod 2. 

We claim that the 0-1 vectors obtained from G include m unit vectors where 
the f-th unit vector has its i-th entry 1 and 0 elsewhere. Inductively, we assume 
there are vh), 1 < i < j, whose restriction to the first j coordinates are the first 
j unit vectors. The base case is established by taking I = {1}. For j + 1, take 
/ = {/ I 1 < / < j, = 1} U {j -1-1} C {1,2, .. . , j -I- 1}. Then there is a u 
from G, such that = 1 mod 2. Define 



then satisfies our requirement. 

This is fairly clear. Formally, starting with k < j, 

^ Vk'’ = Ufc + l\uj=i, i=k] = 0 mod 2. 

For coordinate j -I- 1, 

(adding zero terms) 

k£l 

H (ufe -b 4'"^ + 4+i) + “j+i 

k:l<k<j,kGlfUk—l 

k^I 

= 1 mod 2. 

To complete the induction, we replace by -|- for i G I,i ^ j- bl 

We actually established more, namely we computed the Galois group G = 
Gal(Qm/Q). Moreover, reverting back to the multiplicatively independent set 
|/3i, . . . , /?„}, for any j, the proof gives a permutation a G G which flips to 
— but fixes all others. These generate G as a free abelian group of exponent 
2. Hence we have 

Corollary 1 . For any sequence 01,02, . . . ,Om G Q*, Q(X) 4 i = Qm. In 

particular this is true for any sign sequences Oi = ±1. 

Proof. If o-iy/Pl) were a proper subfield of Q^, there would be a non- 
trivial automorphism of G which fixes But a non-trivial automor- 

phism must flip some y/]Ti, leading to a non-trivial linear relation 2ai^/JTi = 0, 
where 5^0. Knowing the structure of G, this is obviously impossible as we can 
apply some a which flips one \fpTo where to G S' and fixes all others, and get 

\/fe=0. □ 
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Let f{xi, . . . , Xn) G Q[a:i, . . . , x„] have degree di on Xi. Let ki = |"log2(di + 
1)] , and s = Let gi,i, . . . , qi^ki ,■■■, qn,i, • • ■ , qn,k„ be any s multiplica- 

tively independent elements of i?=Q*/Q*^, and . . . ,ai^ki, ■ ■ ■ ,an,i, ■ ■ ■ ,cin,k„ 
be any s elements of Q*. Denote by qi = X/j=i then 

Corollary 2. WehaveCl{qi, . . . ,g„)=Q(ygTT, ■ • ■ 
and / is non- zero if and only z/ /(gi, . . . , g„) yf 0. 

Proof. The first equality follows from Corollary 1 by noting the following sand- 
wiched containment 

Q( v^9l,l ) ■ ■ ■ ) v^9l,fei ) • ■ • 5 9n,l ) ■ ■ ■ ) dn,k„ ) 

n 

= 

^ Q(9l5 ■ ■ ■ 5 Qn) 

^ Q('\/Qi,1’ ■ • ■ t • 1 y/Qn,li ■ • ■ i y/ Qn,kn ) ■ 

Hence for all j > 1 

[Qidi, ■■■,<!]) ■ Q{qi,---,dj-i)] = 2 '^L 

The second claim follows from this by a simple induction. □ 

The algorithm by Chen and Kao [4] uses the special case of Corollary 2 on 
pair-wise relatively prime integers to arrive at the test for polynomial identities. 
With Corollary 2, we have generalized their algorithm to be used with any 
multiplicatively independent rationals or integers. In the next section we estimate 
the probability that polynomially many random integers are multiplicatively 
dependent modulo squares. We show that this probability is superpolynomially 
small. Thus most such sequences of integers are multiplicatively independent. 
On the other hand, the probability that polynomially many random integers are 
pair-wise relatively prime is exponentially small. 



3 The Probability of Multiplicative Dependence 



In this section we consider various probabilities related to multiplicative depen- 
dence. 

We will first show that the probability that a multiplicative dependence mod- 
ulo squares among polynomially many random integers is extremely small. For 
this result we need the function L{N) := exp(i/log N log log N) . Pomerance [10] 
showed that if we choose random positive integers , . . . , < N, then (from 

our point of view) 



Pr[deg Q(yft, . . . , < 2”] 



0, if n< 

1, ifn>L(iV)^+C 



His bound for the probability in the first case is 0(l/i/IoglV), which is not sharp 
enough. We are, however, interested in much smaller n relative to N , and can 
simplify his argument and still get a useful bound. 
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Theorem 3. Fix d > 0. Let qi,q 2 , ■ ■ ■ ,qn be chosen independently from the 
uniform distribution on {1, , N}, with n < (log N^. The probability that there 
is a nonempty S C {1, . . . , n} and an integer r such that Hies T ~ fnost 

Proof. We will call a number i?-smooth if all its prime factors are less than or 

equal to B. If a nontrivial product from <71 , . . . , is a square, then at least one 
of the following two events must occur. Either for some prime p > B, we have 
p^\qt for some i or p\qi, qj for some i < j (call this C), or one of gi, . . . , is i3- 

smooth (call this S). We have Pr[C] < X)p>B + (2) /P^) — (”2 AlsOj 

Pr[S'] < , where f){N,B) counts the S-smooth numbers < N. From [2] 

we know that when 

log N log N 

u := < , 

log B log log N 

we have 'tp{N,B)/N < e~“^°s«(i+o(i))^ uniformly as m — >■ 00. The choice of B 
involves a tradeoff that is familiar from the analysis of factorization algorithms: 
we would like B large so as to make C unlikely, but we would also like it small 
so as to prevent S. A good choice is 



f llogNloglogN 

^ = “P(V 2 

which makes the probabilities roughly equal. Then it is straightforward to prove 
that Pr[C] and Pr[S'] < from which the result follows. □ 

Note that the bound is asymptotically much less than inverse 

exponential polylog in input size log N, 

Corollary 3. For any c > 0, under the hypotheses of the last theorem, we have 
Pr[degQ(vd?T, ...,y<p0 < 2”] = o(l/exp((loglog 



I 21ogiV 
log log N ’ 



as N ^ 00 . 



The probability that qi, ... ,q„ are pairwise relatively prime is also of interest. 
It is easy to see that the probability of this is exponentially small, for example 
by considering adjacent pairs. Below we give a bound with explicit dependence 
on the parameters. 

Theorem 4. Let gi, . . . , be chosen independently from the uniform distribu- 
tion on {!,..., IV}. The probability that these numbers are pairwise relatively 
prime is at most (1 + n)2“” + n/N. 

Proof. For the qi to be pairwise relatively prime, it is necessary and sufficient 
that this condition hold locally for each p. To get our estimate we use only 
p = 2. First assume that N is even. The residues of the qi mod 2 are a random 
sequence of bits, of which at most one can be 0. Using the binomial distribution. 
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the probability of this event is 2“” + n2“" = (1 + n)2“". If N is odd, either all 
Qi are < N — 1, for which the previous case applies, or some Qi equals N. The 
chance of this last event is at most n/N. □ 



One can get better results by considering more primes. 



Theorem 5. For n fixed and N — >■ oo, the probability of the last theorem has 
the limit 



Pin) = n 

p 





Proof. Let be the event that qi, . . . ,q„ are pairwise relatively prime. Let Gp 
(“good at p”) be the event that no pair has a gcd divisible by p, and Bp (“bad 
at p”) its complement. Then 



nc. 


- ^ Fr[Bp] < Ft[En] 


< Pr 


1 

Z) 
1 


p<tc 


p>x 




[p<a: J 



We have Pr|Sp] < hi, so 



(;) 



Px(n) < liminf Pr[ifAr] < limsupPr[ifAr] < Pxin), 

X Af-hoo AT-hoo 



where 



Px{n) = n 

p<x 



1 + 



p- 1 




By the binomial theorem, each factor of Px is l + 0(p ^), so lima,_>oo Px = P{n), 
which implies the theorem. □ 



4 The Lower Bound 

In this section we prove that for any choice of s multiplicatively independent qij’s 
of polynomially many bits, where 1 < i < n, I < j < = |"log 2 ((ii + 1 ) 1 , s = 

Xir=i prespecified choice of signs in the sums qi = XljLi (“1)^*”’ 

and any polynomial bit precision L, there are non-zero polynomials f{xi , . . . , Xn), 
where the degree of Xi is at most di and the coefficients are integral and all 
bounded in n, such that while f{qi , . . . , q-n) ^ 0, for all £ < L, fi[qi]e , . . . , [qn]e) = 
0, where [qi\i denotes the ^-bit truncation of qi. This provides a negative answer 
to the open question from Chen and Kao [4]. We can even take / multilinear 
and all coefficients from {—1,0, 1}. 

In fact, a much stronger result can be shown where L, the bit precision, can 
be substantially increased from polynomial in n to almost 2”, yet still some 
non-zero multilinear polynomial / with coefficients from {—1,0,1} can vanish 
on all truncations up to L bits. Such a polynomial can even simultaneously 
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vanish on all L-bit truncations on a large (almost 2") set of arbitrarily chosen 
initial irrational points. The proof is a direct adaptation of Dirichlet’s method 
showing the existence of simultaneous rational approximations with bounded 
denominators. 

Theorem 6. Given any n and any degree sequence d\, . . . ,dn, let D = UtM+ 
1). LetC,L,Q be arbitrary positive integers, such that (2C+1)^ > (2^'^^CDQ)^ . 
Let qi^i , . . . , qi^ki ) ■ ■ • j > Qn,k„ be any integers multiplicatively independent 

modulo squares, where ki = [log 2 (di + 1)]. Let qi = where Oij 

are any non-zero integers. Assume \qi\'^' < Q. Then, there are non-zero poly- 
nomials f{xi, . . . ,Xn) with integral coefficients bounded by C, and degree in Xi 
bounded by di, such that f{[qi]i, ■ ■ ■ , [qn)]e) ^ 0, /or all I < £ < L, where [qi]i 
denotes the £-bit truncation of qi. 

Corollary 4. Lf 2C = 2‘^, Q = 2'^, and 2” > L{L + c + g + n)/c, then there are 
non-zero multilinear polynomials satisfying the above condition. 



Corollary 5. Lf 2” > tL{L + g + n + 1), then there are non-zero multilinear 
polynomials with coefficients from {—1, 0, 1} that vanish on all L-bit truncations 
on a set of t arbitrarily chosen initial irrational points q^^l ,q^‘^\ . . . , q^*^\ where 
each q^‘‘l = {q[''\ q^\ • ■ • , qn'^ has the form given in Theorem 6. 

Proof. For any f where is a sequence of integers, with 

^ ji ^ di, 1 < i < n and ICj,,... I < C, define a polynomial by 

f^{xi,...,Xn)= ^ 

0<ji<di,l<i<n 

The evaluations of / at all £-bit truncations of qi defines a (linear) map from 
[-C, C]^ to {-CDQ, CDQffi, by / {f^{[qi]e, • ■ • , [qn]i), ^<£< L). Note that 

there are at most D non-zero terms in and each term is strictly less than CQ, 
being strict since qi is irrational. 

Now divide the range into 2^^^ CDQ equal parts along every dimension. More 
precisely, for each dimension 1 <i < L, divide the interval range {—CDQ, CDQ) 
into 2^~^^CDQ equal length subintervals, i.e., for I < ji < 2^^^ CDQ the 
jith subinterval is {—CDQ -\- {ji — l)/2'^, —CDQ -\- ji/2^] and for the last one 
ji = 2^^^CDQ, the subinterval is {CDQ — 1/2^, CDQ). In this way, we obtain 
{2^^^CDQ)^ cells each congruent to (0, (except on each dimension the last 
one has open boundaries on both sides). 

Since (2C-b 1)^ > {2^+^CDQ)^, some two images fall in the same cell, i.e., 
there are two distinct tuples / yf such that 

Ifdidih^ ■ • • > kn]e) - h'{[qi]e, ■■■, [qn]i)\ < 



for all 1 < £ < L. 
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However, each quantity is a rational number with at most L bits after the dec- 
imal point. In other words, both . . . , and . . . , 

are integers. It follows that fi{[qi]i, ■■■, [qn]e) ~ , [qn]e) = 0, for all 

1 < i < L. Now take then the non-zero polynomial 

fz has fz{[qi]i, [qn\e) = 0, for all 1 < .^ < L. □ 
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Abstract. We present an 0(n log^ n)-time randomized algorithm for 
gossiping in radio networks with nnknown topology. This is the hrst 
algorithm for gossiping in this model whose running time is only a poly- 
logarithmic factor away from the optimnm. The fastest previously known 
(deterministic) algorithm for this problem works in time log^ n). 



1 Introduction 

The two classical problems of disseminating information in computer networks 
are broadcasting and gossiping. In broadcasting, we want to distribute a given 
message from a distinguished source node to all other nodes in the network. 
In gossiping, each node v in the network initially contains a message nriv, and 
we wish to distribute each message to all nodes in the network. In both 
problems, we would like to minimize the time needed to complete the task. 

In radio networks, a message transmitted by a processor is sent to all pro- 
cessors within its range. The range relation is represented by a graph of nodes 
with directed edges between them. All processors work synchronously, and if 
a processor u transmits a message m at time step t, the message reaches each 
neighbor u of u at the same time step. Node v will successfully receive m only if 
u is the only processor, among those whose range contains v, that transmits at 
time t. Since the communication links are uni-directional, there is no feedback 
mechanism in the network (see, for example, [20]), and thus, in general, a node 
does not know for certain whether its transmissions were successful. Further, 
we assume that collisions cannot be resolved nor detected, that is, if messages 
from two or more processors reach v at time t, v does not get any message and 
it does not know that the collision occurred. This is motivated by situations 
when message collisions are difficult to distinguish from background noise, in 
which protocols that do not depend on the accuracy of the collision detection 
mechanism (see [13,12]) will be more reliable. 

We focus on gossiping algorithms that do not use any information about 
network topology. Such topology-independent algorithms are useful in networks 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 483-492, 2001. 
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with mobile users or unstable topologies, since then one does not need to change 
or reconfigure the protocol after topology changes. As long as the network is 
strongly connected and no changes occur during the actual execution of the al- 
gorithm, the task of gossiping will complete successfully. The strong connectivity 
assumption is necessary for gossiping to be meaningful. 

Past work. Most of the previous work on radio networks focussed on broad- 
casting. If the topology of the network is known to all processors, Gaber and 
Mansour [14] showed that broadcasting can be achieved in time 0{D + log^n), 
where D is the network diameter. Diks et al [10] gave efficient broadcasting al- 
gorithms for special types of known networks. It is also known that computing 
an optimal broadcast schedule for a given network is NP-hard, even for points 
in the plane, where the graph is induced by node ranges, see [8,26]. 

For networks with unknown topology. Bar- Yehuda et al [3] gave a randomized 
algorithm that achieves broadcast in expected time 0(Z?logn -I- log^ n). This is 
very close to the lower bound of f2{Dlog{n/D)), by Kushilevitz and Mansour 
[18], and it matches this lower bound for a wide range of depth values, for 
example when D = for any e > 0. Further, if Z? is a constant, it also 

matches the lower bound of l7(log^ n) for constant diameter networks, obtained 
by Alon et al [1]. 

In the deterministic case. Bar- Yehuda et al [3] gave an f2(n) lower bound 
for constant diameter networks. For general networks, the best currently known 
lower bound of l7(nlogn) was obtained by Bruschi and del Pinto [5], and, in- 
dependently by Chlebus et al [6]. In [6], the authors also present a broadcast 
algorithm with time complexity - the first sub-quadratic upper bound. 

This upper bound was later improved to 0(n®/^log^n) by De Marco and Pelc 
[11]. Chlebus et al [7] developed several broadcasting algorithms, including one 
with time complexity In an independent work, using a probabilistic 

construction, Peleg [23] gave an 0{n^^‘^^logn) upper bound. Recently, Chrobak, 
Gasieniec and Rytter [9] presented a deterministic algorithm for broadcasting 
with time complexity O(nlog^n), thus nearly matching the lower bound of 
l7(nlogn) from [5,6]. As in [23], this broadcasting algorithm was constructed 
using a probabilistic argument. 

The problem of gossiping has been intensely studied in various network mod- 
els (see, for example, [15]). For radio networks, Ravishankar and Singh [24,25] 
studied gossiping algorithms for some restricted topologies, including paths and 
rings, under probabilistic assumptions on the spatial distribution of nodes. In 
our previous work, [9], we developed a deterministic algorithm for gossiping 
with time complexity log^ n), which, to our knowledge, is the only sub- 

quadratic algorithm for gossiping in radio networks with unknown topology. 

Our results. In this paper we give a randomized 0(n log"' n)-time algorithm 
for gossiping in radio networks with unknown topology. Our basic algorithm 
is Monte Carlo and it has the following performance characteristics: for any 
0 < e < 1, in time 0(nlog^ nlog(n/e)) it achieves broadcast with probability at 
least 1 — e. This easily yields a Las Vegas algorithm with expected running time 
0(nlog^ n). 




A Randomized Algorithm for Gossiping in Radio Networks 485 



2 Preliminaries 

Radio networks. A radio network (see [3,7]) is defined as an n-node directed 
graph whose nodes are assigned unique identifiers from the set {l,2,...,n}. 
Throughout the paper, for gossiping to be meaningful, we assume that the net- 
work is strongly connected. If there is an edge from u to v, then we say that v 
is an out-neighbor of u and u is an in-neighbor of v. 

Initially, each node v contains a message and has no other information. 
The time is divided into discrete time steps. All nodes start simultaneously, 
have access to a common clock, and work synchronously. (As noted by Peleg 
[23], the assumption about a common clock is not necessary.) At any time step, 
a node can be in one of two states: the receiving state or the transmitting state. 
A gossiping algorithm is a protocol that for each identifier id and for each time 
step t, given all past messages received by id, specifies the state of id at time 
t. If id transmits at time t, the protocol specifies the message. A message m 
transmitted at time t from a node u is sent instantly to all its out-neighbors. 
However, an out-neighbor u of u receives m at time step t only if v is in the 
receiving state and if no collision occurred, that is, if the other in-neighbors of 
V do not transmit at time t at all. Further, collisions cannot be distinguished 
from background noise. If v does not receive any message at time t, it knows 
that either none of its in-neighbors transmitted at time t, or that at least two 
did, but it does not know which of these two events occurred. 

The running time of a gossiping algorithm is the smallest t such that for any 
strongly connected network topology, and for any assignment of identifiers to 
the nodes, each node receives all messages no later than at step t. 
Simplifying assumptions. For clarity of presentation, we will present our al- 
gorithms as if the nodes knew n, the size of the network. This assumption can 
be eliminated by a standard doubling technique (see [6,7]) that works as follows: 
We organize the computation into phases, and we modify a given algorithm so 
that in phase i only nodes with labels at most 2® participate in the algorithm. 
This does not change the asymptotic running time. 

Further, we will also assume throughout the paper that n is a power of 2. 
For other n, the processors can execute the algorithm for the nearest power of 2 
larger than n, without changing the asymptotic running time. 

Notation. By V we denote the set of nodes, and individual nodes are denoted 

by letters u,v, Messages are denoted by letter to, possibly with indices. The 

message originating from a node v is denoted by to„. The whole set of initial 
messages is M = {rriy : v & V}. During the computation, each node v will store 
a set of messages that have been received by v so far. Initially, My = {to„}. 
Without loss of generality, whenever a node is in the transmit mode, we can 
assume that it transmits the whole My. This is achieved by a procedure denoted 
transmit(My). Procedure receive{) puts v in the receive mode and it returns the 
received message, or null if no message has been received. 
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3 Limited Broadcast 

One component of our algorithm is a procedure for limited broadcasting. Given 
an integer k and a node v, the goal of limited broadcasting is to send the message 
My to at least k nodes in the network. We refer to v as the source node or the 
node that initiates the broadcast, and to My as the source message. 

In [9], the broadcasting algorithm is defined by a sequence S = SqSi . . . 
of subsets of {1,2 ,..., n}. At time t, any node v that has already received the 
source message checks whether n G S'* . If so, v transmits the message, otherwise 
V is quiet. We modify the algorithm from [9], so that it performs the broad- 
casting procedure for only O(fclog^n) steps. Below, we appropriately refine the 
correctness proof, since the proof from [9] is not sufficient for our purpose. 

The pseudo-code for the algorithm executed by each node is given below. 
Each node v has a boolean flag activcy that indicates whether v is active, that 
is, whether v has received a source message. Each iteration of the for-loop lasts 
one time step. The value of constant 7 will be determined later. 

Procedure LtdBroadcast„(A:). 

for T = 0, 1, . . . , 7/c log^ n — 1 do 

if activCy and v € Sr then transmit{My) 

else 

m-<r- receive{); 
if TO yf null then 

My ^ My U {to} ; 

activCy true 



Lemma 1. Assume that initially exactly one node u is active and that all nodes 
V begin executing LtdBroadcast„(A:) simultaneously. Then, for some constant 
7 independent ofk, after the computation is complete at least k nodes will receive 
the message from u. 

Proof. Without loss of generality, assume n > 2. In [9], it was shown that 
there is a constant a such that for each j = 0 , . . . , log n there is a family 
Sj = {Sjfi, Sj^i , . . . , Sj^aj-i) of aj = a2^ logn sets with the following property: 
(*) For any two disjoint sets X, Y with 2-^“^ < |A| < 2^ and |F| < 2^ 
there exists a set S'jy in Sj such that \X fl Sj^i\ = 1 and Y fl Sj^i = 0. 

Let 7 = 12a -|- 2. The sequence S consists of stages, with each stage, except 
possibly the last one, having log n +1 steps. Note that the number of stages is 
[y/clog^ n/(logn -I- 1)] > (Saklogn + 1. The transmission set at the jth step of 
stage s, that is Sr for r = s(log n -I- 1) -I- j, is Sj^s mod o„ • 

Among the active nodes we distinguish two types of nodes: frontier nodes, 
which still have inactive out-neighbors, and inner nodes, which don’t. If there 
is a time step t < yfclog^ n when there are k or more frontier nodes, then the 
lemma holds trivially. So from now on we assume that at each step the number 
of frontier nodes is less than k. 
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We define a sequence of stages sq = 0, si, . . . , s;+i = 6afclogn, where 1 < 
Sc+i — Sc ^ 2o;A:logn for all c. Denote by ic and fc the number of inner and 
frontier nodes when stage Sc is about to start. We will choose si, . . . , s; so that 
the following invariant holds for each c < 1: 



2*c + /c > 



Sc 

2a log n ' 



( 1 ) 



Given (1), we can prove the lemma as follows. The number of nodes that have 
received the message when stage s/+i ends is at least 



b + // > 5(2*/ + fi) 

> 

4a log n 

^ — 2ak log n 

~ 4a log n 

= k. 

So it is sufficient to construct si,...,s/ that satisfy (1). We define these 
stages inductively. For c = 0 we have Sq = Oj *0 = 0 and /o = 1, so (1) holds. 
Suppose we have determined some Sc- If Sc > 4aklogn, set I = c and we are 
done. Otherwise, we proceed as follows. 

Let F be the set of frontier nodes at the beginning of stage Sc, and let g be 
such that 2®“^ < |F| < 2®. For each j = 1, . . . ,g, let Yj be the set of nodes that 
received the message in stages Sc> Sc + 1) • ■ • ) Sc + — 1 (but were inactive when 

stage Sc started). 

We have two sub-cases: 

Case 1 : There is j for which \Yj\ > 2K In this case, take Sc+i = Sc + Oj- At least 
\Yj\ new nodes received the message, so 



2*c-i-i + fc+i > 2ic + fc+ \Yj\ 

> — — -k 2® 

2a log n 

^ 5c-|-l 

~ 2a log n 

Case 2 : For each j we have \Yj\ < 2®. We show that in this case all nodes in F 
will become inner after Og stages. 

Fix any node v that is inactive when stage Sc starts, and whose set X of in- 
neighbors in F is not empty. Pick j such that 2®“^ < |Jf| < 2®. Since \Yj\ < 2®, 
by property (*), family Sj contains a set Sj^i that hits X and avoids Yj. This 
Sjj will occur in one of the stages Sc, Sc + t, ■ ■ ■ , Sc + aj — 1. 

All in-neighbors of v are either in X or are inactive when stage Sc starts. 
When we use Sjj for transmission then: 

(i) exactly one in-neighbor of u in AT will transmit because \Sjj fl AT| = 1, 

(ii) the nodes from Yj will not interfere because Sjj fl Y,- = 0, 
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(iii) the nodes that were inactive at the beginning of stage Sc and are not in Yj 
remain inactive when Sj^i is issued, so they will not transmit. 

Therefore v will receive the message when Sj^i is issued (unless it has already 
received it earlier). Since v was an arbitrary inactive out-neighbor of F, we 
conclude that all nodes in F will become inner after ag stages. 

Take Sc+i = Sc + o,g- In this case ic+i > *c + /c and fc > 2®“^, so 

2ic-i-i + fc+i > 2zc + fcF fc 

> + 23-1 

2a log n 

^ Sc+l 

~ 2a log n 

We thus proved that (1) holds for Sc+i. Further, since |F| < k, we have Og = 
a2®logn < 2ak\ogn, and thus in both sub-cases we have Sc+i — Sc < 2a/clogn. 
The proof of the lemma is now complete. 

4 Distributed Coupon Collection 

In each phase of our algorithm we will attempt to distribute each message niy 
to some number of nodes, by performing a sequence of limited broadcasts. We 
need to achieve two goals. To obtain a good running time, the number of limited 
broadcasts must be small. Further, each message rriy should participate in at 
least one limited broadcast, that is, must be in at least one M„, for some u 
that initiates a limited broadcast. 

To choose the nodes v for which we initiate a limited broadcast, we use 
randomization. The principle behind the random process we use is similar to 
that in the coupon collector’s problem. There are two differences though. First, 
each coupon may have several copies. Second, since we do not have enough time 
to coordinate the choices, we cannot guarantee that exactly one node will initiate 
broadcasting. 

We think of V as a set of n bins and M as a set of n coupons. Each coupon 
has at least k copies, each copy belonging to a different bin. My is the set of 
coupons in bin v. Consider the following process: At each step, we open bins at 
random, by choosing each bin, independently, with probability 1/n. If exactly 
one bin, say f , is opened, all coupons from are collected. If no bin is opened, 
or if two or more bins are opened, a failure occurs and no coupons are collected. 
How many steps do we need so that with high probability (a copy of) each 
coupon is collected? 

The distributed coupon collection can be written in pseudocode as follows. 
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Procedure DistCouponColl{s). 

repeat s times 

for each bin v do 

with probability 1/n do open v 
else close v 

if exactly one bin v is opened then 
collect all coupons from 



Lemma 2. Assume we have n bins and n coupons, and that each coupon has at 
least k copies, each copy belonging to a different bin. Let S be a given constant, 
0 < i5 < 1, and s = (4n/A:) ln(n/5). Then, after performing DistCouponColl{s), 
with probability at least 1 — all coupons will be collected. 



Proof. The lemma is trivially true for n = 1, so we can assume that n > 2. Let 
Xm,j be the event that coupon m is collected at a given step j. Then Pr[Xm,j ] 
is the probability that one bin containing m is opened and all other bins are 
closed. 

For all m and j we have 



Pr[T„.,] > - ( 1 

Jii 

n 

A 

“ 4n’ 



where the last inequality follows from the fact that the sequence (1 — 1/n)" is 
monotonely increasing. The probability that some m is not collected in s steps 
is 



Pr 



y m^M A 









< 




s 



< A 



by the definition of s. 

5 The Gossiping Algorithm 

We now present our Monte Carlo algorithm for gossiping. Each node v performs 
its version of the algorithm. 
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Algorithm RANDGossiP^(e). 

5^e/\ogn 

for i = 0, 1, . . . ,logn — 1 do 
Phasei : 

Si ^ (4n/2*) ln(n/5) 

repeat Si times 

for each node v do (in parallel) 

with probability 1/n do activey-^ true 
else activcy ^ false 
LtdBroadcast„ (2*+^ ) 



Theorem 1. Let e, 0 < e < 1, be a given constant. With probability at least 1 — e, 
Algorithm RANDGossip(e) completes gossiping in time 0(nlog^ nlog(n/e)). 

Proof. In phase i, the call to LtdBroadcast„(2*+^) costs 0(2* log^ n), so phase 
i costs 0(sj2*log^n) = 0(n log^ nlog(n/<5)). Since we have logn phases and 
6 = e/logn, this implies the bound on the running time of the algorithm. 

Initially, when phase 0 starts, each m„ is in one set My, namely in M„. The 
algorithm attempts to maintain the invariant that after phase i each is in 
at least 2*+^ sets My. If this invariant is preserved at each phase, the gossiping 
will complete successfully, since after phase log n — 1 each will be in n sets 
My. Thus it is sufficient to prove that the probability that the invariant fails in 
some phase is at most e. 

The process of distributing messages in a given phase i is equivalent to the 
distributed coupon collection problem described in the previous section, where 
we view each node as a bin, and active nodes correspond to open bins. Thus, by 
Lemma 2, the probability that the invariant fails in this phase, assuming that it 
has not failed in any previous phase, is at most S. So, overall, the probability of 
failure in some phase is at most log n ■ S = e. 

To obtain a Las Vegas algorithm, run RANDGossip(e) with e = 1/n. After 
the algorithm halts, each node that has not received n different initial messages, 
announces that a failure occurred. Since all these nodes send out the same mes- 
sage, this can be achieved with just one broadcast. If B is the running time of 
the broadcasting algorithm, after B steps each node knows whether the gossip- 
ing was successful or not. If the gossiping failed, we run a naive, deterministic 
RoundRobin algorithm that consists of n rounds, with each node transmit- 
ting once in each round. This will achieve gossiping in time 0{n^). Overall, the 
expected running time will be 0((1 — l/n)nlog^ n -I- (l/n)n^) = 0{nlog^ n). 
Concluding, we get the following theorem. 

Theorem 2. There exists a randomized Las Vegas algorithm for gossiping with 
running time O(nlog^n). 
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6 Final Comments 

Several open problems remain. The only known lower bounds for gossiping are 
those for broadcasting [1,5,7], a seemingly easier problem. The gap between 
lower and upper bounds is particularly wide in the deterministic case: between 
l7(nlogn) and log^ n). Closing or at least reducing this gap is an inter- 

esting open problem. 

Our algorithm is probably not optimal. One possible research direction is to 
investigate whether one can improve the running time of our algorithm by using 
the randomized broadcasting algorithm from [3] to perform limited broadcast. 
It is not quite clear whether the algorithm from [3] can be modified to satisfy 
Lemma 1 . Further, the analysis of this modified algorithm will probably be much 
more complicated. 

Improving the bounds on gossiping or broadcasting may resolve the question 
whether gossiping is harder than broadcasting in the radio network model (both 
in the randomized and the deterministic case). Our result implies that, at least 
in the randomized case, the difference is at most a poly-logarithmic factor. 
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Abstract. Grover’s quantum se arch algorithm finds one of t solutions 
in N candidates by using (-k / 4)^J N /t basic steps. It is, however, neces- 
sary to know the number t of solutions in advance for using the Grover’s 
algorithm directly. On the other hand, Boyer etal proposed a rando mized 
application of Grover’s algorithm, which runs, on average, in 0{y/N/t) 
basic steps (more precisely, (9/4)y^ N/t steps) without knowing t in ad- 
vance. Here we show a simple (almost trivial) deterministic application 
of Grover’s algorithm also works and finds a solution in 0{y/WJt) basic 
steps (more precisely, (Stt/S) y^iV/t steps) on average. 



1 Introduction 

Grover [Gro96, Gro97] proposed a quantum algorithm — Grover’s search algo- 
rithm — that solves the following general search problem much faster than any 
randomized/deterministic algorithm designed on classical computers. 

Search Problem 

Given: For any n > 0, an oracle Boolean function / on the set 
{0, 1}” of binary sequences of length n. 

Question: Find some sequence x G {0, 1}" such that f{x) = 1. 

Remark. In general, a solution, i.e., a sequence x G {0, 1}” satisfying f(x) = 1, 
is not unique, in which case it is sufficient to output any one of such solutions. 

For any given n, since there are iV = 2" binary strings in {0, 1}”, it is (almost 
obvious) that any algorithm solving the above problem needs N steps to find 
the desired sequence. Surprisingly, though, Grover’s search algorithm finds the 
desired sequence in 0{V~N) quantum computation steps, where each quantum 
step (which we refer G-steps) can be implemented by some poly(n) number of 
basic quantum gates. More precisely, when there are t solutions, i.e., t binary 
sequences satisfying /, Grover’s algorithm finds some of them in 0{^jN/t) G- 
steps. Note, however, that one needs to know the number t in advance in order 
to achieve this better bound; but t, the number of solutions, is usually unknown 
in advance. For solving this problem, Boyer etal [BBHT96] proposed an algo- 
rithm that applies Grover’s algorithm with randomly chosen parameters, that 
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is, a randomized application of Grover’s algorithm, which runs, on average, in 
0{^/Nji) G-steps without knowing t in advance. Here we show that a simple (in 
fact, almost trivial) deterministic application of Grover’s algorithm also works 
and finds a solution in 0(y/ N/t) G-steps on average without knowing t in ad- 
vance. (Notice also here that any classical computation including randominzed 
one can be simulated by some quantum computation. In fact, for any classical 
algorithm using any quantum algorithm(s), one can design a single quantum 
Turing machine that executes the whole algorithm. In this sense, there is no 
merit of removing randomness from the application algorithm. Yet, the authors 
think that the simpler algorithm is the better.) 

Grover’s Algorithm and Its Randomized Application 

Let us discuss our problem specifically. We start with recalling Grover’s al- 
gorithm and some basic facts about the algorithm. Notions and notations for 
quantum computation we use here are standard; see, e.g., [BV97, Gru99, Hos99, 
ForOO]. 

Gonsider any n > 0 and any oracle function / on {0, 1}" for specifying the 
above general search problem, and let us fix them in the following discussion. 
That is, our task is to find a sequence x G {0, 1}” that satisfies f{x) = 1. A 
sequence x £ {0, 1}” satisfying f(x) = 1 is simply called a solution. Let N = 2”, 
i.e., the total number of sequences in {0, 1}", and let t denote the number of all 
solutions among N candidates. 

In Grover’s algorithm, each sequence x G {0, 1}" corresponds to a quantum 
base state |cc) consisting of n qubits. (In the following, we identify an n qubit 
base state with the corresponding n bit binary sequence.) The main ingredients 
of Grover’s algorithm are the following three unitary transformations on n qubit 
states. 

Walsh-Hadmard transformation : 

(Here x ■ y denotes the bit-wise inner product.) 

Sign flipping on /: 

Sf. |x) (-!)■''("’) |a;) 

Sign flipping on 0: 

So : |0) i-G- — |0) and |a;) i-G- \x) (if x yf 0”) 

By using these transformations, one G-step is defined as the following unitary 
transformation U . 



U = -WSoWSf 

That is, one G-step is to apply this C/ to a current state. Grover’s algorithm is 
to apply U for some appropriate number of times to the following initial state 
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ae{0,i}" ^ 

Formally, by the j G-step execution of Grover’s algorithm (or more simply G{j)) 
we mean to apply UUU ■ ■ - U to </>o and then observe the obtained state. (We 

^ -V- 

j 

assume that the observation is made so that some n qubit base state (i.e., n bit 
sequence) is observed with the probability that is the square of its amplitude.) 

For justifying this procedure, the following property of U plays a key role 
[Gro96, Gro97]. 

Lemma 1. Gonsider the quantum state obtained by applying U to </>o for j 
times. Then each solution state, i.e., a base state corresponding to a solution, 
has the same amplitude (which we denote Oj) while the other base state also has 
the same amplitude (which we denote bj). Furthermore, by using 0 that satisfies 
sin 9 = ^tjN , these amplitudes are stated as follows. 



Qj = ^sin((2j + 1)6»), 
bj = cos((2j + 1)0). 

The angle (2j + 1)0 determines the amplitude aj. In the following we call this 
angle the angle {of a solution) after applying U for j times {to the initial state) 
(or more simply, the angle after executing G{j)). Note that after applying U for 
[(7 t/ 4)0J times, the angle gets close to 7 t/ 2; hence, the amplitude of each solution 
is close to 1/ \/t, which means that the total probability that solution states are 
observed is close to 1. Note also that (7 t/ 4)0 is approximately {tt / 4) ^/NJt by 
using the approximation 0 « sin0 (= ^^t/N). This argument leads us to the 
following theorem of Grover [Gro96, Gro97]. 

Theorem 1. Define mg by 




Then a state observed by the execution of G{mo) is one of the solutions with 
probability approximately 1 — 1/N (k. 1). 

Therefore, if we can compute mo, we would simply execute G{mo) to get some 
solution. It is, however, not so easy to compute it because t is usually unknown in 
advance. Note that we cannot simply execute G{m) with some m > mo; in this 
case, the probability that some solution is observed could become much smaller. 
In order to solve this problem, Boyer etal [BBHT96] proposed the following 
randomized algorithm. 

It is clear that this algorithm finds a solution. On the other hand, we can 
show the following time bound. 
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Algorithm Randomized-Grover 

A ^ 6/5; i ^ 0; 

for i 1 to 00 do 

m A*; 

select j,0<j<m — 1, uniformly at random; 
execute G{j) and let x be the observed state; 
if f{x) = 1 then output x and halt; 

end-for; 

Fig. 1. A randomized application of Grover’s algorithm 



Theorem 2. The average number of G-steps executed in the above algorithm is 
at most {9/4)y/N/t. 

Remark. In the proof of this theorem and in the following discussion in gen- 
eral, we often assume that t/N is small enough, say, t/N < 1/100 and use some 
approximations like \/t/N « sin \/t/N under this assumption. Thus, our bounds 
may not be precise; but even so, it is easy to see that the bounds shown here cer- 
tainly hold by changing constant factors. We leave the tedious precise calculation 
to the interest reader. Also though our arguments need an assumption such as 
t < A/100, this assumption is not essential for designing a fast search algorithm. 
In fact, ift> A/100, then we can search for a solution by simply picking any n 
bit sequence randomly, say, 1,000 times. 

2 Deterministic Application of Grover’s Algorithm 

Here we show that a simple and deterministic execution of Grover’s algorithm 
still yields a similar result. More specifically, we consider the algorithm given in 
Figure 2. 



Algorithm Deterministic-Grover 

% The parameter k is fixed to 2. 

for i 1 to 00 do 

m ^ 2h 

execute G{m) for k times and let xi, ..., Xk be the observed states; 
if f{xu) = 1 for some u, 1 < u < k then output Xu and halt; 

end-for; 

Fig. 2. A determinisitic application of Grover’s algorithm 



Again for this algorithm, we can show that its average running time is 
0{y^N/t). In the following we will prove this fact after preparing some tools. 

First let us see that at some point of the for-iteration of the algorithm, the 
angle after executing G{m) in this iteration gets reasonably close to n/2. 
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Claim. There exists some integer to > 0 such that 

7T /IT 

- < (2-2*0 + 1)0 < 

Furthermore, since 6 « yJt/N , we have 2*“ < {Tr/3)y^N/t. 

Proof. Clearly, (2 • 2* + 1)0 (« (2 • 2* + l)^/t/N) > irfi for some z > 0. Let i be 
the largest integer such that (2 • 2* + 1)0 < tt/ 3 holds. Then we have 

O-rr 

(2-2*+i + l)0 < 2(2 -2* + 1)0 < — . 

3 

Therefore, the claim holds with ig = i + 1. 

Below we will keep using to to denote the one satisfying this claim. Also let 
W denote the above range of angles; that is, W = [tt/3, 27t/3]. As stated in the 
above claim, the angle at the toth for-iteration gets into W ; then the probability 
that a solution is found at this for-iteration is, as we will see below, more than 
1 — (1/4)^. In general, for any i, let us simply call the angle (2 • 2* + 1)0 the angle 
at the ith for-iteration. Recall that the success probability is defined by sin^ of 
angles; hence, we had better discuss angles with their residues under modulo tt. 
Thus, in the following, by “an angle” we mean the residue of a real angle under 
modulo TT. For any for-iteration, we have the following relation between the angle 
and the failure probability at this for-iteration. 

Claim. For any z > 0, let w be the angle at the zth for-iteration. Let a = \tt /2—ui\ 
and 5 = sin^ a. Then the probability that no solution is found at this for-iteration 
is 5^ . (Note that if w G W , then we have <5 < 1/4.) 

Proof. Since the angle at the zth for-iteration is ui, the probability that some 
solution is found by one execution of G{m) (where m = 2*) is 

/ 1 , V .2 

t- ^sinw = sm u). 

Wi J 

Hence, the probability that no solution is found at this iteration is 

(1 — sin^w)* = (cos^w)^ = (sin^ Itt/ 2 — w|)^ = (sin^a)^ = S^. 

It would be nice if we could argue that the angle belongs to W reasonably 
often. Unfortunately, however, if the angle (at some for-iteration) gets very close 
to 7 t/ 2, then it takes rather long time to have an angle in W again. Note, on 
the other hand, that if the angle is close to 7 t/ 2, then the probability of finding 
a solution at this iteration is close to 1. The crucial point of our analysis is to 
estimate this trade-off. 

For simplifying our discussion, instead of an angle uj, we will argue by using 
a = \tt / 2 — Lo\, which we call a co-angle. For example, the above claim shows 
that if the co-angle (at some for-iteration) gets less than 7 t/ 6, then the error 
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probability at this iteration becomes less than (1/4)''. Let A denote this good 
co-angle range; that is, A = {a : 0 < a < 7 t/6}. 

For our analysis, we estimate the number of for-iterations until having a 
co-angle in A again. 

Claim. Consider any ith for-iteration with the co-angle a £ A. (We assume that 
a > 0.) Define h = log|’7r/(3a)] . Then there exists some h, 1 < h < h, such that 
the co-angle at the {i + /i)th for-iteration gets into A again. 

Remark. For simplifying our discussion, the argument here is a bit unprecise. 
More specifically, it may be the case that the co-angle at the (f-|-/i)th for-iteration 
is slightly larger than 7 t/ 3; but the difference is at most -1-30, which should be 
small enough under our assumption on t/N. 

Proof. Here we show that the angle at the {i + ft.)th for-iteration belongs to the 
range W = [tt/3, 27t/3]. 

Since the claim clearly holds if a = 7 t/ 6, we may assume that a < tt/ 6, which 
implies that h >2. Recall that the definitions of angle and co-angle; that is, we 
have {(jj =) 7t/2 — a = (2 • 2* -|- 1)0, where uj is the angle at the ith for-iteration. 
(Here we consider only the case that ut < tt/ 2. The case that w > tt/ 2 can be 
treated in a similar way.) Then for any h' , 2 < h' < h, we can estimate the angle 
w' at the (i -I- /i')th for-iteration as follows. 

uj' = {2- 2*+'^' -b 1)0 (mod 7 t) 

= 2^' ■ ((2 • 2* -b 1)0) - {2^' - 1)0 (mod tt) 

= 2'^' ■(!-«)- (2^' - l)e (mod tt) 

= —{2^ a -b {2^ — 1)0 — rmr) (mod tt). 

Here m is some nonnegative integer such that 0 < 2^^ a -b {2^ — 1)0 — tott < tt. 
Since sinw' = sin(7r — u'), we can discuss by using Tv' = 2^^ a + {2^ — 1)0 — tott 
instead of the above oj'. 

Now we consider the first h' < h such that 2'* a -b (2'* — 1)0 > tt/3. Such h' 
exists because we have 2^ a > tt/3 from our choice of h. Furthermore, we may 
assume that h' > 2, because in the special case that h' = 1, we can easily show 
that h = 2 satisfies the claim. Also notice that uJ' = 2'' a + {2^ — 1)0; that is, 
TO = 0 for this h' . 

Clearly, we are done if oj' < 2tt/3. But if otherwise, i.e., if 2'* a -b {2^ — 1)0 
> 27t/3, then the angle U" = 2’''~^a + {2'^'-\- 1)0 (« (2'*'a -b {2^' - l)0)/2) 
should be > tt/3. On the other hand, since 2'* a -b {2^ — 1)0 < tt , we have oj" 
= 2'* ~^a + {2^ — 1)0 < tt/2 (< 27t/3). Therefore, the claim is satisfied with 

either h = h' or h = h' — 1. 

Next we divide the co-angle range A. Below let j be any nonnegative integer. 
Let Sj = 2“^^+^), and let Sj be the co-angle < tt/2 such that Sj = sin^ aj. (For 
example, we have <5 q = 1/4 and So = ’’‘Zb.) Then define Aj = {a : Sj+i < a < 
Sj}. Also we define hj as follows. 
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hj — 



log 



3a 



i+i 



Since oij gets very small, we may assume that sin o.j « o.j . In particular, we may 
assume that Sj+i > aj/2. (In fact, we would have a^+i > ajj^/2 for large j.) 
Hence, we have /ij+i < hj + \ . 

Here let us summarize our discussion. Consider any ith for-iteration, and 
suppose that the co-angle at this iteration belongs to Aj. Then it follows from 
the above claims that (i) the probability that the algorithm does not halt at this 
for-iteration is < 5^ (= and (ii) the co-angle gets into A again before 

the (i + hj)th for-iteration. 

Now we are ready to prove our theorem. 



Theorem 3. The average number of G-steps executed in Deterministic Grover 
(the algorithm of Figure 2) is {Sir / i) N / 1 . 

Proof. Recall that the co-angle belongs to A at the ioth for-iteration. We may 
assume that it is indeed the first for-iteration having a co-angle in A. In general, 
we use ii, i 2 , ..., to denote the first, the second, ... for-iteration (after the ipth 
for-iteration) where the co-angle belongs to A. Also let ao, ai, ... denote the 
co-angles at the foth, zith, ... iterations. 

To bound the expected number of executed G-steps, we may assume that 
the execution of the algorithm reaches to the ipth for-iteration. Thus, we first 
estimate the number of G-steps executed by the end of the ipth for-iteration, 
which is bounded as follows. 



k-{l + 2 + 2^ + --- + T°) = fc-(2*°+i-l) < (1) 

Next we analyze the expected number Eq of G-steps executed after the ipth 
for-iteration. (Precisely speaking, Eq that we will analyze below is the expected 
number Eq of G-steps executed after the ioth for-iteration under the condition 
that the ioth for-iteration is executed.) Although we know that the co-angle «o 
at the loth for-iteration is in A, we do not know which Aj it belongs to. But for 
each j > 0, by assuming that a G Aj, we can estimate the failure probability 
and the length to the zith for-iteration. Thus, we can bound Eq in the following 
way. (Here we use Ei to denote the amount corresponding to Eq-, that is, Ei is 
the expected number of G-steps executed after the iith for-iteration under the 
condition that the zith for-iteration is executed.) 



Eq < ^ Pr{ no solution is found at the zoth for-iteration | ao G Aj } 

j>0 

X (# of G-steps executed during ((zg -I- 1) ~ zi)th for-iteration -|- Ei) 



^ + • • • + -h El) 

3>0 
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- (2^0+hj+i _ 1 ) 4. ^ 

3>0 j>0 

j>0 

Here by using the fact that hj < + j and Hq < 2, and our choice of Sj and k 

(i.e., Sj = and k = 2), we can bound the last expression as follows. 



< fc2*o 




-)hj+l 



Eq < fc2*° • • 2'*°+^+! +Y^J - El 

\j>0 ) j>0 

= 2-^-1 + -ill = kr<^ + Y<^j ■ El. (2) 

i>o j>o j>o 

At this point, let us see how Ei is estimated. Notice that Ei depends on the 
value of ii. Otherwise, the calculation is quite similar to the one for Eq. (Here 
we use E 2 denoting the amount corresponding to Ei for 12 .) 



El < ^ Pr{ no solution is found at the iith for-iteration | a\ G Aj } 

3>0 

X (# of G-steps executed during ((ti + 1) ~ i 2 )th for-iteration -|- E 2 ) 
^ + 2*'+^ -b • • • -b 2*1+'*^) -b E 2 ) 

3>0 

< kT^ ■ I X I + X ■ ^2 < fc2*i + X • ^2- (3) 

\j>o J j>0 j>0 

We substitute Ei of (2) by (3). Here notice that ii depends on the choice of oq; 
in fact, if oq G Aj, then it is bounded by io + hj (< io + ^0 + J < *0 + j + 2). 
Thus, we have the following bound. 



Eo < kr« + X I *2*1 + X X • -^2 



Ji>0 



J2>0 



< fc2*« -b Y • /c2*°+2i+ 2 -b ^ ^ • E 2 

ji>o ji>o y2>o 

= fc2*« -b kT° Y + X X • ( X X ■ -*^2 

ji>o ii>o \i2>o 



= *2*“ + Xa2*o+^5^. I ^5^,.A2 

ji>o \i 2 >o 
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We can similarly expand i? 2 , E^, ■■■, thereby deriving the following bound. 



2kn / N 



Eo < k2^° +2~^ -k2^o +2~Ek2^o = 2fc2*° < —\l—- (4) 



Now the bound of the theorem is immediate from (1) and (4), and our choice 
of k. 
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1 Introduction 

MAX SAT is one of famous combinatorial optimization problems stated as fol- 
lows: given a multiset of clauses, find an assignment that maximizes the number 
of satisfied clauses (that is equivalent to finding an assignment that minimizes 
the number of unsatisfied clauses). MAX 3SAT is restricted version of MAX 
SAT, that is, its input is restricted to a multiset of 3-clauses, i.e., each clause 
contains exactly 3 literals whose underlying variables are distinct each other. 
Since these problems are not only NP-hard problem, but MAX SNP-complete 
problem, there is no polynomial time approximation algorithm whose approxi- 
mation ratio is close to 1 unless P = NP. Furthermore, Hastad showed that it is 
NP-hard to approximate MAX 3SAT within 8/7 — £ for any £ > 0 [5] . In spite 
of these negative results, many polynomial time approximation algorithms with 
proven approximation ratio for MAX SAT have been proposed [3,4,7]. 

To evaluate these approximation and exact algorithms experimentally, test 
instances are necessary. While there are some benchmark problems for MAX 
SAT, we still do not have enough number of problems, and we would like to 
have sure way generating nontrivial test instance systematically. In particular, 
it would be nice if we have an efficient test instance generators for MAX SAT. 

Test instance generator have been studied for NP problems (see, e.g. [1, 2, 6, 
8]). On the other hand, for optimization problems like MAX SAT, no generator, 
as far as the author knows, has been proposed in the literature. 

When one evaluates an approximation algorithm, test instances should have 
their optimal solution otherwise we can not determine how good approximated 
solution are obtained. In spite of necessity of optimal solution, it seems hard to 
check whether given a (candidate of) optimal solution is really optimal. In fact, 
in the case of MAX SAT, it is co-NP complete to check the optimality of a given 
solution. 

Since the hardness of generating a test instance with its optimal solution 
seems unavoidable, we consider two alternative approaches. The first one is to 
estimate (analytically and experimentally) the expected number E[s{F)] of op- 
timally satisfied clauses of F, when MAX SAT instance F is generated under 
a certain distribution. Suppose that we can estimate A[s(F’)] with reasonable 
precision. Then even if we can not supply optimal solutions, we could still test 
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the average performance of a given MAX SAT algorithm. That is to say, we 
just run it on generated instances and compare the average number of clauses 
satisfied by obtained solution with if[s(F)]. The second approach is to design 
a test instance generator that provides both an instance and its candidate of 
optimal solution that is really optimal with high probability. 

In this paper, we obtain some basic results for these two approaches. For the 
first one, we analyze the minimum number u{F) of unsatisfiable clauses of given 
formula F. We obtain two lower bounds on its expectation E[u{F)], I7(m/lnm) 
and m/8 — 0{^/mn), for random 3CNF formula with m clauses over n variables. 
We also analyze the expectation experimentally. For the second approach, on 
the other hand, we propose a simple random instance generator that, for given 
u, generates an instance and an assignment (i.e., a candidate for the optimal 
solution) that leaves just u clauses unsatisfied. Furthermore, we prove that the 
generated solution is indeed optimal with high probability if the number of 
clauses is I7(nlnn) + f2{nVu In n). 

Hereafter, we describe basic notations. Let V = {ui, . . . , u„} be propositional 
variables. We represent true and false by 1 and 0 respectively. An assignment t 
is a mapping from variables to Boolean values, ui, . . . , — >■ {0, 1}”. We denote 

by Td a set of assignments whose Hamming distance from 1" is d. A literal is 
a propositional variable v or its negation v. We use a complement-free set of 
exactly 3 literals to denote a 3-clause, a disjunction of exactly 3 literals. Let 
3CNF formula F be a multiset of 3-clauses, and we denote |F| by m. We say F 
is a random 3CNF formula with m clauses over n variables if each clause in F is 
chosen with replacement from possible 8(g) clauses over n variables uniformly 
at random. For any F and t, we denote the number of unsatisfied clauses in F 
at an assignment t by u{F,t). Furthermore, u{F) is the minimum number of 
unsatisfied clauses in F for all assignments, that is, u{F) = min^g^o,!}" u{F,t). 
We say u{F) is the optimal value of F. 

In this abstract, we omitted proofs and some explanations. See research re- 
port in http://www.is.titech.ac.jp/. 



2 The Average Optimal Value for Random 3CNF Formula 

We consider the expected number E[u{F)] of unsatisfiable clauses on the optimal 
solution. There exists an obvious upper bound m/8 since it is the upper bound 
of u{F). Here we prove two lower bounds of E[u{F)]. 

Theorem 1. For any m > nln2/ln(8/7) « 5.19n, the expected number of un- 
satisfiable clauses on optimal assignment satisfies the following bound, 

mln8/7 — nln2 -I- ln(m/7 — 1) , , , 

, — (1 - 7/m). 

In m/7 

The sketch of the proof is as follows. For any assignment t, the probability 
Pr[M(F, t) = v] that the number of unsatisfiable clauses in F on t is exactly 
V is estimated as (™) By using this estimation, we can bound 



E[u{F)] > 
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(a) the average of u{F) (b) the ratio to m of difference between the experimen- 

tal and the theoretical value 



Fig. 1. Experimental results 



the probability that the number of unsatisfiable clauses in F on its optimal 
assignment is larger than v. Finally, we use Markov’s inequality. 

In the Theorem 1, rounding by in'" causes the denominator of upper 
bound, In m/7. Thus we need to estimate the probability that u{F) is larger than 
V more tightly. The probability Pr[M(F') > u] includes partial sum of binomial 
distribution. For sufficiently large m, we can approximate binomial distribution 
to normal distribution by using central limit theorem. This observation leads us 
to the next result. 

Theorem 2. For any m > 7nln2 + -\/7n In 2 — 16\/7n In 2 — 8 « 9.7n — 8, the 
expected number E[u{F)] of optimally unsatisfiable clauses satisfies following 
bound, 

E[u{F)] > ^ fm — V 14mn In 2 + 8 ) ( 1 = ) . 

8 V 7 y V 77mm In 2/ 

We compare theoretical lower bounds with the average number of unsatisfi- 
able clauses on optimal assignment that we obtained through some experiments. 

Now we describe the setup used in the experiments. We fixed the number of 
variables to 50, and we selected the numbers of clauses m from 250 to 1000 at 
intervals of 50 (we skipped 900). For each m, we checked the average optimal 
value to 100 (50, for over 600 clauses) random 3CNF. 

We show results in Fig. 1(a) , along the horizontal axis we give the number of 
clauses m, and along the vertical axis we give the average number of unsatisfiable 
clauses on optimal. Big black dots are results of experiments, and dotted line and 
solid line are lower bounds of Theorem 1 and Theorem 2 resp. Furthermore, white 
circle are values obtained by calculating the partial sum of binomial distribution, 
argmax.„ ~ ■ With this graph, it seems that 

behavior of experimental results is similar to that of lower bound of Theorem 2. 

In Fig. 1(b), we show the ratio of difference between the experimental and 
the theoretical value to m. Black (white, resp.) dots are the ratio of the difference 
between the experimental value and the lower bound of Theorem 2 (the value 
that obtained by calculating the binomial distribution, resp.). We can see that 
the difference is as much as 0.02m. Thus we assume that the relation of E[u{F)] 
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with m forms (1/8 — ci)m — ^c^+cs, and get (1/8 — 0.00620)m — \/7.13m+ 10.9 
(dashed line in Fig. 1(a)) by the least squares. 

3 Random Instance Generation for MAX 3SAT 

Our final goal is to generate MAX SAT instance with its optimal solution at ran- 
dom. When evaluating MAX SAT solver experimentally, the optimal solution is 
necessary since it is hard to determine whether obtained solutions are correct. 
On the contrary, it is also hard to check a (candidate of) optimal solution gen- 
erated with an instance. Hence, we relax the condition of instance generator 
such that the candidate of optimal solution is correct with high probability, that 
is, for given n and u, our generator generates appropriate size MAX SAT in- 
stance over n variables and an assignment (for candidate of optimal solution) 
that unsatisfies exactly u clauses. 

Our basic idea is as follows. Let F and t be the output instance and assign- 
ment of our instance generator for given n and u. We assume that 1” is chosen 
as t w.l.o.g. Of course, F shall have exactly u clauses that are not satisfied by t. 
These clauses shall be the form of {vf, T/, uF}. There exist some assignments in 
Ti that satisfy some of these u clauses while the number of unsatisfied clauses 
by 1" is exactly u. Therefore, for each of these u clauses, we add three more 
clauses forms of {vi,Uj^,Vi^}, {vj,i^,Vj^} and {vk,W^,vj^}, respectively. Now 
each assignment in T\ has exactly u unsatisfied clauses. Then we add appropri- 
ate number of clauses that satisfied by t. Intuitively, t becomes one of optimal 
solutions of F with high probability when enough large number of clauses are 
added. 

Now we introduce our instance generation algorithm formally. First, we define 
some sets of clauses Cg-, Cg+, C|, Cg- and C 3 + as follows: 

Cg~ — {{uii , Vi2 , ^ 23 } , Vi4 . , , Ujg , U 27 } , {Uji , Uig , Ujg}}, 

^9-t- ^i2 j 5 ) ^z4 ; } 5 {^z2 5 ^z6 7 ^z?} j ^z8 5 ^zq} 1 ? 

Q = vE} , {W, va, Ui4}}, 

Cz- = hill}, and C 3 + = {vF^va^Viz} . 

Hereby we show a pseudo code of our algorithm in Fig. 2. In this algorithm, 
u many sets in the form of Cg_ (Cg+, resp.) guarantee that an assignment 1" 
(0”, resp.) does not satisfy u clauses exactly and each assignment in Ti (T„_i, 
resp.) does not satisfy at least u clauses. On the other hand, there will exist some 
assignments that does not satisfy less than u clauses. In such case, t is no longer 
the optimal solution of F. To avoid these assignments, we add r clauses satisfied 
by both of 1” and 0” (Cz- and C3+). We also add b many sets of to 

balance the number of occurrence of each literal. Intuitively, the larger r makes 
the higher probability that t is the optimal solution of F. Hence, we analyze the 
condition of r such that t is the optimal solution of generated instance. 

Let random variable Xt be the number of clause sets forms of Cg_, Cg+, 
Cl, Cz-, and Cz+ that contain at least one clause not satisfied t. It is clear that 
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input: the number of variables n, the desirable optimal value u, 

the number of clause sets UY=iCl, b, the number of random clauses r. 

begin 

let F = 0; 
repeat u times do 

choose 9 distinct variables Vri, ■ ■ ■ , Vr 9 and add C 9 _(wri, • • • , Ur-g) to F; 
choose 9 distinct variables Vri, ■ ■ ■ , Vr 9 and add C 9 +{vri, ■ ■ ■ , Vrg) to F; 
repeat b times do 
for i = 1 to n do 

choose 4 distinct variables Vri, • • • , Vr4 £ \ {vi} and add Cg{vri, ■ ■ ■ , Vri) to F-, 

repeat r times do 

choose 3 distinct variables Wri,Ur 2 ,Wr 3 and add C 3 _(Wri, Wr 2 , fi-s) to 
choose 3 distinct variables Wri,Ur 2 ,Ur 3 and add C 3 +(uri, Ur 2 , Urs) to F\ 
choose a truth assignment t £ {0, 1}" at random; 
flip literals whose under lying variables are assigned 0 on t; 
output F and t; 
end. 



Fig. 2. Algorithm. 



u{F, t) is at least u for any t in Tq (= {!"}), Ti, T„_i and T„ (= {0”}). Thus our 
goal is to show the condition that Xt is at least u for any t in Td {2 < d < n — 2 ) 
with high probability (we remark that Xt is less than u{F,t)). 

First we consider fid the expectation of Xt on arbitrary assignments t in Td- 

Lemma 1. For any d (0 < d < n), for any assignment t G Td, let fid he the 
expected number of clause sets Cg-, Cg+, C 5 , C 3 - and C 3 + in F such that each 
set contains at least one clause not satisfied by t (the expectation of Xt). Then 



fid>u-u 



d{n 

n{n 



d) . din — d) d(n 

1) n — 1 n[n 



1 )' 



By using this lemma, we can use Chernoff bound to bound the probability 
that Xt is less than u and prove following theorem. 

Theorem 3. For any positive e < 1, the probability that t is optimal solution 
of F is at least 1 — e if 

r > u — bn + 4(n — 1) ln2n — 2(n — 1) Ine + (n — l)\/2]n2(ri^^hieV^. 



In theoretical analysis, we prove that our instance generator outputs instance 
with optimal value u with high probability if r is w + 0{nlnn) + 0{nVuln n). 
But is such a large number really required? In fact, this upper bound becomes 
over 1300 even if u = 5, n = 50 and e = I. Thus we compare this theoretical 
bound with r obtained through experiments. Each experiment, we calculate the 
ratio of instances whose optimal value is u for 100 instances. 

First, we estimate whether our algorithm can cover the average optimal value. 
We fix the number of variables n to 50 and b to 0. We select u from 5, 10, 15 and 
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(a) the ratio of instances changing op- 
timal value u 



(b) the number of clauses and optimal 
value u (ratio is 0.5) 



Fig. 3. Changing the value to be optimal u (n = 50, b = 0) 




(a) the ratio of instances 
whose optimal value is 5 



(b) r for each n where 50% in- 
stances have optimal value 5 



Fig. 4. Changing the number of variables n (m = 5, 

& = 0) 




Fig. 5. Changing h. 
(n = 50, u = 5) 



20. Results are in Fig. 3(a). The intervals of results are smaller as opposed to u. 
For each u, we pick up two values of m = Su + 2r, the number of total clauses, 
such that one is the largest m where the ratio is less than 50% and the other 
is the smallest m where the ratio is larger than 50%. We compare this result 
with the average optimal value obtained experimentally in Section 2 (Fig. 3(b)). 
From the figure, we can see that the number of clauses that the ratio of instance 
with optimal value u is just 50% is near the the number of clauses whose average 
optimal value is u. Thus, we consider that our algorithm generates an instance 
whose optimal solution is given with high probability if the optimal value is quite 
smaller than the average optimal value. 

Second, we want to estimate whether the main term of upper bound on r is 
nlnn or not. We choose n from 40 to 100 at intervals of 10. And we fix m to 5 
and b to 0. Results are in Fig. 4(a). Then we also pickup two values of r where 
the ratio of instances with given optimal solution is just smaller and larger than 
0.5 for each n. In Fig. 4(b), the vertical axis is value of r where the ratio of 
instances with given optimal value is just stepping over 0.5, the horizontal axis 
is the number of variables n. From this result, the relation of r and n seems linear 
or the constant of nlnn is very small. Hence, we try to fit by linear function, 
and get r = 2.6n + 14.8 (dotted line). 
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Finally, we change 6 , the number of clause sets In the theoretical 

upper bound, there exists the term —bn. This means that adding b sets of 
is equivalent to adding bn random clauses of the each form C 3 - and C 3 + ( 2 bn 
clauses). We fix the number of variables n to 50 and the optimal value u to 5 
again. We choose b from 1, 2 and 3. Results are in Fig. 5, the vertical axis is the 
ratio of instances whose optimal value is 5 and the horizontal axis is the total 
number of clauses m. We can easily see that b diminishes the total number of 
clauses. Furthermore, it contribute converging the ratio to 1 much faster. 
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Abstract. We study variations of Steiner tree problem. Let P = {pi,p 2 , 
. . . ,Pn} be a set of n terminals in the Euclidean plane. For a positive 
integer k, the bottleneck Steiner tree problem {BSTP for short) is to 
find a Steiner tree with at most k Steiner points such that the length 
of the longest edges in the tree is minimized. For a positive constant 
R, the Steiner tree problem with minimum number of Steiner points 
{STP — MSP for short) asks for a Steiner tree such that each edge 
in the tree has length at most R and the number of Steiner points is 
minimized. In this paper, we give (1) a ratio- \/3 -I- e approximation al- 
gorithm for BSTP, where e is an arbitrary positive number; (2) a ratio-3 
approximation algorithm for STP-MSP with running time 0(n®); (3) a 
ratio- 1 approximation algorithm for STP-MSP. 



1 Introduction 

Given a set P of terminals in the Euclidean plane, a Steiner tree on P is an 
acyclic network interconnecting the terminals. Every vertex in a Steiner tree 
other than terminals is called a Steiner point. 

A variation of Steiner tree problem, named bottleneck Steiner tree problem 
(BSTP for short) is defined as follows: given a set P of n terminals and a positive 
integer k, we want to find a Steiner tree with at most k Steiner points such that 
the length of the longest edges in the tree is minimized. Contrary to the classic 
Steiner tree problem, degree-2 Steiner points are allowed in BSTP. Instead of 
minimize the total length of the tree, here we want to minimize the length 
of the longest edges. The problem has applications in the design of wireless 
communication networks [8]. Due to budget limit, we could put totally n + k 
stations in the plane, n of them must be located at given points. We would like 
to have the distances between stations as small as possible. The problem here is 
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to choose the locations for the other k stations to minimize the longest distance 
between stations. For other related variations and applications, see [1,2, 4, 7]. 

The problem is NP-hard. In [8], it shows that the problem cannot be approx- 
imated in polynomial time within performance ratios 2 and \/2 in the rectilinear 
plane and the Euclidean plane, respectively. Moreover, a ratio-2 approximation 
algorithm was given for both the rectilinear plane and the Euclidean plane [8]. 
For the rectilinear plane, the performance ratio is the best possible. A ratio-1.866 
approximation algorithm was given for the Euclidean plane [9]. In this paper, 
we give a randomized approximation algorithm with performance ratio -\/3 -I- e 
for the Euclidean plane, where e is an arbitrary positive number. 

Another variation of Steiner tree problem considered here is the Steiner tree 
problem with minimum number of Steiner points, denoted by STP — MSP for 
short, is defined as follow: Given a set P of n terminals in the Euclidean plane, 
and a positive constant R, STP-MSP asks for a tree T spanning a superset 
of P such that each edge in the tree has length at most R and the number 
C(T) of Steiner points is minimized. In the STP-MSP problem, Steiner points 
of degree 2 are possible. The STP-MSP problem has an important application in 
wavelength-division mnltiplexing optical network design [3, 5]. In [4], Lin and Xue 
proved that the STP-MSP problem is NP-hard and gave a ratio-5 approximation 
algorithm. In [1], Chen, Du et al. showed that the algorithm given by Lin and Xue 
[4] has performance ratio exactly 4, and they also presented a new 0(n‘^)-time 
approximation with performance ratio at most 3, where n is the number of given 
terminals. In this paper, we give a 0(n^)-time approximation with performance 
ratio at most 3, and a randomized approximation with performance ratio at 
most |. 

A full component of a Steiner tree is a subtree in which each terminal is a 
leaf and each internal node is a Steiner point. A Steiner tree for n terminals is 
a k-restricted Steiner tree if each full component spans at most k terminals. 

2 Ratio-Vs Approximation Algorithm for BSTP 

In this section, we give a ratio--\/3 approximation algorithm for BSTP in the 
Euclidean plane. 

Theorem 1 Let T be an optimum Steiner tree for BSTP. Then, there is a 3- 
restricted Steiner tree with the same number of Steiner points as T such that the 
longest edge in the tree is at most -\/3 times the optimum. 

Proof: We assume that T is rooted by arbitrarily selecting a Steiner point as 
its root. We will modify T bottom up into a 3-restricted Steiner as described in 
the theorem. Without loss of generality, we assume that T is a full Steiner tree. 

We organize the nodes in T level by level (ignoring degree-2 Steiner points) . 
Level 1 is the lowest level. Level i is the level above level i — 1. Let v be a node 
at level 3, v' a child of v. If v' is a Steiner point, we can assume that the degree 
of v' is 3. Otherwise, we can always find two children of v' , say a and b, such 
that lav'b < 120°. Without loss of generality, assume \av'\ > |6w'|, and assume 
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the path P from a to v' has m degree-2 Steiner points. Then we construct a new 
Steiner tree T' by removing all the edges on P, and directly connecting a and b 
with m degree-2 Steiner points so that the length of each edge in the segment 
ab is at most -\/3. Now, we need only to consider the tree obtained from T' by 
removing a and all the degree-2 nodes on the path connecting a and b in T' . 

In the rest, we assume that the degree of v' is at most 3. We consider two 
cases. 

Case 1. Every edge below u in T has length no more than 1. We consider 
the case where v has 4 grandchildren, and leave the case where v has 3-children 
to the interested readers. 

We first consider that w is a degree-3 node. (See Figure 1: (a).) In this 
case, we can assume that Ibv'c > 120° and Zd/e > 120°. Therefore, we have 
min{/d/ri, Ze/f} < 120° and nAn{/_cv'v^ /-bv'v} < 120°, i.e., 

min{|dr;|, \ev\} < -\/3 and min{|cz;|, |6u|} < VS. (I) 

Without loss of generality, assume that 

Idfv = 0 = min{lvv'v, Ivv'c, Idfv, lefv}, and lvv'c<lvv'b. (2) 

We will find a point h on edge vv' such that max{|c/i|, \bh\, \dh\} < -\/3, and 
construct a new tree by removing nodes v' and /, adding edges ch, cv, dh, bh, 
and connecting d and e directly with a Steiner point w on the middle of de. 
(See Figure l:(b).) Then, we can continue the modification process with n — 3 
terminals in P U {w} — {b, c, d, e}. 

By (1) and (2), we know that \ch\ < V3 for any h on the edge vv'. So, we 
need only to choose an h to guarantee that \bh\ < -\/3 and \dh\ < -\/3. 

First we suppose that 9 < 90°, then one can easily check that the point h on 
edge vv' with \vh\ = 2 — -\/3 satisfies the requirement. 

Now we suppose that 90° < 0 < 120°. By (2), we know that Ibv'v < 360° — 
120° — 0 = 240° — 9. We choose h to be the point on edge vv' such that \vh\ = 





(b) 



Fig. 1. (a) The original tree, (b) The modified tree. 
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•\/3 — \dv\. By (1), \vh\ > 0. It is eay to see that 

\v'h\ < l-\/3 + V2-2cos6». (3) 

By triangle inequality, 



\dh\ < |ciu| + \vh\ = -\/3. 



(4) 



Using (3), we have 

|6/ip = \bv'\'^ + \v'h\^ — 2|&w'||u'/i| cos Lhv'v 

6 —6 

< l + (l-y3 + 2sin-)2 + 2(l- V3 + 2sin-)sin(6» + 30°) = G{9) 



Set H{9) = G{9)-{y/iy = -2+(l-y3+2sin f)2+2(l-V3+2sin f) sin(6»+ 
30°). 

One can verify directly that ^^{9) > 0 for 90° < 9 < 120°. This to- 
gether with the fact that ^f-(90°) < 0 and ^f^(120°) > 0 indicates that 
0(0) < max{0(9O°), 0(120°)} < 0 for 0 G [90°, 120°]. So, ^(0) is 
strictly decreasing on [90°, 120°], and then we have for 0 G [90°, 120°], ^(0) > 
^(120°) > 0. This means that H{9) is strictly increasing on [90°, 120°]. So, 
H{9) < iJ(120°) = 0. Combine this with (4), h is certainly a correct choice. 

Case 2. Some edges below v have length greater than 1. Let u be a steiner 
point which is a child of v and has degree 3, x and y the two terminals connected 
to u. 

Without loss of generality, suppose that \ux\ < \uy\, ux and uy have I and 
k Steiner points (both including u), respectively. Let 2: be the point on uy such 
that \uz\ = 1. then, we can assume that zy contains k — l Steiner points(including 
z), and ux and uz contain totally 2{l — 1) Steiner points(not including u and z). 
We directly connect x and z and equally insert [1.155^] — 1 Steiner points into 
xz. Then, the length of each edge on xz is 



21 

[1.115Z] 



< Vs. 



After that, we have still 2(Z—1) — ([1.115Z] —1) = 2Z— 1— [1.115/] = [0.885/J— 1 
Steiner points which can be used to equally break ux into small edges. Then, by 
inserting [0.885/J — 1 Steiner points into ux, each edge on ux has length at most 



I 

[0.885/J 



< Vs if / > 3. 



By this operation, u is changed into a vertex of degree 2 in the new tree, then 
we can continue the process with n — 1 terminals in P U {uj \ {x, y}. 

The case when / < 2 can be proved by using almost the same argument as 
above. We leave it to the interested readers. I 
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Our algorithm for finding the approximation of an optimal 3-restricted Steiner 
tree is just the same as that of [9] . It is based on the following theorem about the 
minimum spanning tree for a weighted 3-hypergraph. A hypergraph H = {V, F) 
consists of a set ^(called vertices) and a set J^(called edges) which is an arbi- 
trary family of subsets of V. A weighted hypergraph FI = {V, F, w) is a hypergraph 
such that each edge e in F has a weight w{e). An r -hypergraph Flr{V, F, w) is a 
weighted hypergraph, each edge has cardinality at most r. 

Theorem 2 [6] There exists a randomized algorithm for the minimum spanning 
tree problem in 3-hypergraphs running in poly{n,Wmax) time with probability at 
least 0.5, where n is the number of nodes in the hypergraph and Wmax is the 
largest weight of edges in the hypergraph. 

We construct a weighted 3-hypergraph Ff^fV, F, w) from the set P of termi- 
nals. Here V = P, and F = {(a, 6)|a G P and b G P} U {(a, 6, c)|a G P and b G 
P and c G P}. To obtain the weight of each edge in F, we need to know B, the 
length of the longest edge in an optimal solution for BSTP. It is hard to find the 
exact value of B. However, we can find a B' that is at most (1-1- e)B for any e 
in time po?t/(n, e) (interested reader can find the detail in [9]). 

Theorem 3 For any given e, there exists a randomized algorithm that eomputes 
a Steiner tree with n terminals and k Steiner points such that the longest edge 
in the tree is at most -\/3 -I- e times of the optimum running in \ x poly{n, k) 
time with probability at least 0.5. 

3 A Ratio-3 Approximation Algorithm for STP-MSP 

In this section, we consider the STP-MSP problem. Given a set P of n terminals 
in the Euclidean plane, and a positive constant R, we want to find a Steiner 
tree with minimum number of Steiner points such that each edge in the tree has 
length at most R. In [1], Chen et al. presented an 0(n^)-time approximation 
with performance ratio at most 3. With a slightly modification, we may reduce 
the running time to O(n^). 

Our algorithm is given in Figure 2. 

Since we construct 3-stars in Step 2, the algorithm runs in O(n^) time. 

For a given set P of terminals, a minimum spanning tree is a tree intercon- 
necting the terminals in P with edge between terminals. For a given constant R, 
a steinerized minimum spanning tree is a tree obtained from a minimum span- 
ning tree by inserting — 1 Steiner points to break each edge ah into small 

pieces of length at most R. 

Let T be a Steiner tree and e be a line segment. C{T) and C(e) denote the 
numbers of Steiner points in T and e, respectively. |e| denotes the length of e. 

Lemma 4 . [1] Every steinerized minimum spanning tree has the minimum num- 
ber of Steiner points among steinerized spanning trees. 
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Algorithm A: 3-approximation Algorithm for STP-MSP 

Input A set P of n terminals, a positive constant R. 

Output A Steiner tree Ta in which each edge has length at most R. 

0. Sort all possible edges between the n terminals of P in length 

increasing order ei, 62 , • • • , e„(„^) , and set Ta ~ 0 ; 

2 

1 . for every a such that \ei\ < R do 

if Ci connects two different connected components of Ta 
then put Ci into Ta', 

2 . for each subset of three terminals a, b, c respectively in three connected 
components of Ta do 

if there exists a points s within distance R from a, b and c 
then put the 3-star, consisting of three edges sa, sb, sc into Ta; 

3. for every a do 

if Ci connects two different connected components of Ta 
then put Ci into Ta- 



Fig. 2. The ratio-3 algorithms. 



Lemma 5 [4] There exists a shortest optimal Steiner tree T* for STP — MSP 
such that every vertex in T* has degree at most five. 



Lemma Q [1] Let T* be a shortest optimal tree for STP-MSP such that every 
Steiner point has degree at most five. LetTj be a full component ofT* . Then the 
following hold: 

(1) The steinerized minimum spanning tree on terminals in Tj has at most 
3 • C{Tj) -b 1 Steiner points. 

(2) IfTj contains a Steiner point of degree at most four, then the steinerized 
minimum spanning tree on terminals in Tj has at most 3 • C(Tj) Steiner points. 

(3) If the steinerized minimum spanning tree on terminals in Tj has an edge 
(of length at most R) between two terminals, then it contains at most 3 • C(Tj) 
Steiner points. 

From (3) of Lemma 6, we know that if the number of Steiner points contained 
in a steinerized spanning tree on terminals in a full component Tj reaches the 
upper bound 3 • C{Tj) -\- 1, then any two terminals are not connected directly 
by a single edge of length at most R, i.e., there must be a Steiner point between 
them. 

Theorem 7 Let T* be an optimal tree for STP-MSP and Ta an approximation 
produced by Algorithm A. Then C(Ta) < 3C(T*). 

Proof. Let T^ be a steinerized minimum spanning tree on all terminals, and let 
k be the number of 3-stars produced by Step 2 of Algorithm A. Then 

C{Ta) < C{T^) - k. 
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By Lemma 5, we assume that each Steiner point of T* has degree at most 
five. Assume that T* has h full components Ti, T2 , . . . , T^. For i = 1,2, let 
be the components produced by Step i of Algorithm A. We construct a 
steinerized spanning tree T as follows: Initially, set T := then for each full 
component Tj (1 < j < /i), add to T the steinerized minimum spanning tree 
Hj on terminals of Tj, if the resulted tree has a cycle, then destroy the cycle by 
deleting some edges of Hj . Without loss of generality, suppose that Ti , T2 , . . . , Tg 
{g ^ h) are the full components in T* such that every Steiner point has degree 
five and UTg has no cycle. Combining Lemma 4 and Lemma 6 with the fact 
that for destroying a cycle from T U Hj , a Steiner point must be removed unless 
Hj contains an edge between two terminals, we have 

C'(T^) < C{T) < 3C{T*) + g, 

i.e., 

C{TA)<3C{T*)+g-k. 

Suppose that has p components. Then, has p — 2k components 
Cl, C2 , . . . , Cp- 2 k- Now we construct another graph H on all terminals as follows: 
Initially put all edges of into H, then consider every Tj {I < j < g). If Tj has 
a unique Steiner point(this Steiner point connects five terminals which must lie 
in at most two C[s), then among the five terminals there are three pairs (edges) 
of terminals, each pair (edge) lies in the same Ci. We add the three edges into 
H . If Tj has at least two Steiner points, then there are two Steiner points each 
connecting four terminals, and we can also find three pairs (edges) of terminals 
such that each pair (edge) lies in the same Ci. Thus, we can add the three edges 
into iL. It is clear that H has at most p — 5g components. Since each components 
of H is contained by a Ci, we have p — 2k < p — 5g, then g — k < ^ — k < 
This ends the proof. I 



4 2. 5- Approximation of STP-MSP 

In this section, we give a randomized algorithm of ratio- 1 for the STP-MSP 
problem. Following are some useful Lemmas. 

A path (?2 • ■ • j <Zm in a tree T is called a convex path if for every i = 
1, 2, . . . , TO — 3, giQi +2 intersects gi+igi+3. An angle of degree more than 120° is 
called a big angle. An angle of degree less than or equal to 120° is called a small 
angle. 

Lemma 8/1/ Let qiQ 2 . . . ,qm he a convex path. Suppose there are b big angles 
among to - 2 angles Zgi9293, Zg293?4, ■ • ■ , Zgm-2g'm-i9m- Then, \qiqm\ < {b + 
2)R. 

Note that if there is no small angle, \q\qm\ < {b+l)R. Thus, this lemma is 
useful only when there are many small angles in the convex path. 
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Lemma 9 [1] In a shortest optimal tree T for STP — MSP, there are at most 
two big angles at a point of degree three, there is at most one big angle at a point 
of degree four, and there is no big angle with degree five. 

Let T* be a shortest optimal tree for STP — MSP on n terminals which is 
a full Steiner tree. Let Sj denote the number of Steiner points of degree z in T*. 

Lemma 10 [1] 3ss + 2 s 4 + S 3 = n — 2. 



Lemma 11 Consider a clockwise tour F ofT* that visits the n terminals in the 
order t\,t 2 , ■■■ ,tn,ti. (See Figure 3.) Then, 

(i) the tour F has exactly n convex paths Pi,P 2 , . . . , such that Pi connects 
two terminals U and ti+i (tn+i = t\); 

(ii) each angle at a Steiner point appears in those n convex paths exactly once. 
(Hi) connect the two ends of Pi by an edge Ci and then steinerized Ci, i = 

1 , 2 , . . . , n, the total number of Steiner points in any n—1 Ci ’s C= C{ei) 
is upper bounded as follows: 

C Si F 2 s 3 + 2s2 F n — C{cn) /-\ 

= 3(s5 + S4 + S3) + 2s2 + 2 — C(en) 

We denote by Tp the tree consisting ofn terminals and (n— 1 ) edges 64,62, . . . , 

^n— 1 ■ 

Proof, (i) and (ii) are very easy to see from the structure of T*. Now, we prove 

(iii) . Consider the tour F. By Lemma 8, if there are 04 big angles in Pi, then 

there are at most 0^ + 1 Steiner points on Cj, and so the total number of Steiner 
points in F is at most n plus the number of big angles in T* . By Lemma 9 , there 
are at most 2s2 + 2s3 + S4 big angles in T*. From Lemma 10 , we know that (iii) 
is valid. I 

Let T* be a shortest optimal tree for STP-MSP which is a full Steiner tree 
on n terminals. Without loss of generality, we assume that T* has Steiner points 




Fig. 3. The tour F that visits all the terminals 
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of degree at least three. Selecting an arbitrary Steiner point of degree at least 
three as the root of T*, we get a rooted tree. A good point t in T* is a Steiner 
point that is adjacent to some terminals and satisfies one of the following: 

(i) (type (1)) t has three or more terminals as children; 

(ii) (type (2)) t has two terminals as children and the degree of t is 4; 

(iii) (type (3)) t is a point of degree 3. 

Note that a good point is of degree at least 3. A had point is a Steiner point 
of degree at least 3 in T* that is not a good point. 

Theorem 12 There is a 3-restricted Steiner tree such that each edge has length 
at most R and the Steiner points is at most | times the optimum. 

Now, we focus on the computation of an optimal 3-restricted tree. 

Let Hs{V,F, W) be a weighted 3-hypergraph, where V = P, F = {(a, 6)|a G 
V and b G V} U {{a,b,c)\a G V,b G V and c G V}, and for each edge e G F, 
w{e) is the smallest number of Steiner points to form an optimal solution of the 
STP-MSP problem on the terminals in e. 

Given three points a, b and c on the on the Euclidean plane, let s be the 
Steiner point which minimizes (|sa| -I- |s&| -I- |sc|), and let k be the number of 
Steiner points in an optimum solution T of STP—MSP on {a, 6, c} with constant 
R. 



Lemma 13 



,-|saL rIscL „ . , Isal , , |s6| , , Iscl , „ 

- 2 > fc > L Vj + lVJ + LVJ - 2. (6) 

it it it it it it 



Proof. By steinerizing the optimum Steiner tree, we get a solution of STP — 
MSP on {a,b,c} with exactly |"^1 -f [^1 + ~ 2 Steiner points. 

Let |T| be the total length of T, which is the sum of the length of edges of 
T. Then 

(fc — 1) • i? -I- 3R > \T\ > |sa| -I- |s6| -I- |sc|. 



i.e.. 



k ‘2 P 



|g^l + |gc| ^ ^ ^|s6| j j 

R R R R R 



Therefore, (6) holds. 

Lemma 13 gives an upper bound on the cost of {a,b,c). 



I 



Lemma 14 [9] Testing whether three circles has a point in common can be done 
in constant time. 



For any given points a, b and c on on the Euclidean plane, one can find the 
minimum Steiner tree on {a,b,c} in constant time. Let qa,b,c be the number of 
Steiner points used to steinerize the optimum Steiner tree on {a, b, c}, and qp = 
max{qa^b,c\{o.,b, c} C P}. Then, by Lemma 13 and Lemma 14, the weight W of 
H^{V, F,W) can be calculated in 0{n^qp) time. By Theorem 2 and Theorem 
12, we have 
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|-approximation Algorithm for STP-MSP 

Input A set P of n terminals in the Euclidean plane, a positive constant R 
Output A 3-restricted Steiner tree T in which each edge has length at most R. 

1. Construct a weighted hypergraph Hs{V, F,w). 

2. Call the randomized algorithm in [6] to compute a minimum spanning 
tree T for H 3 CV, F, w); 

3. Replace every edge / of the MST T' of H^iV, F,w) with a Steiner tree 
with w{f) Steiner points such that the maximum length of each edge in 
the tree is at most R and output the obtained tree. 



Fig. 4. The complete algorithm. 



Theorem 15 Given a set P of n terminals and a positive constant R, there 
exists a randomized algorithm that computes a solution of STP — MSP on P 
such that the number of Steiner points is at most | times of the optimum running 
in poly{n, qp) time with probability at least 0.5. 

The complete algorithm is given in Figure 4. 
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Abstract. In this paper, we study the problem of computing a minimum 
cost Steiner tree snbject to weight constraint in a series-parallel graph 
where each edge has a nonnegative integer cost and a nonnegative integer 
weight. We present a fully polynomial time approximation scheme for this 
NP-complete problem. 



1 Introduction 

A computer network is often modeled by an undirected graph G = (V, E) where 
V is the set of vertices and E is the set of edges. The traditional Steiner tree 
problem associates an edge cost c(e) > 0 with each edge e £ E and asks for a 
minimum cost subgraph of G spanning a given subset E Q V of target vertices. 
Such problems find important applications in computer networks and have been 
studied by many researchers [1,4,10,13,15]. We refer the readers to [8,11] for 
details. 

In this traditional network model, there is only one cost corresponding to 
each edge. In real-life problems, each edge may have a weight besides a cost, and 
we want to find a minimum cost subgraph of G spanning the vertices in E with 
a total weight no more than a given weight constraint W. We call this problem 
the weight constrained minimum cost Steiner tree problem (WGSTP). 

Formally, we generalize the traditional network model to allow two indepen- 
dent edge weighting functions: with each edge e £ E, there is an associated 
integer cost c(e) > 0 and an associated integer weight w(e) > 0. Let G' be a 
subgraph of G. The cost of G', denoted by c(G'), is the sum of the edge costs of 
G'. The weight of G', denoted by w(G'), is the sum of the edge weights of G'. 
Given a set of target vertices E QV and an integer weight constraint W > 0, we 
are interested in computing a minimum cost tree subgraph T of G which spans 
the vertices in T subject to the constraint that w{E) < W. We call such a tree 
a weight constrained minimum cost Steiner minimum tree. 

When w{e) = 0 for every edge e £ E, the WGSTP becomes the well known 
Steiner tree problem. Since the Steiner tree problem is NP-hard in the strong 
sense [6], the WGSTP is also NP-hard in strong sense. In this paper, we are 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 519-528, 2001. 
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interested in WCSTP on a very important class of sparse networks known as 
series-parallel graphs (S-P graphs), which are subgraphs of 2-trees defined below. 

Following [13], a 2-tree can be defined recursively as follows, and all 2-trees 
may be obtained in this way. A triangle (a complete graph on three vertices) is a 
2-tree. Given a 2-tree and an edge {x, y} of the 2-tree, we can add a new vertex 
z adjacent to both x and y, the result is a 2-tree. A series-parallel graph (also 
known as a partial 2-tree) is a subgraph of a 2-tree. With this definition, one can 
see that a 2-tree on n vertices has 2n — 3 edges and n — 2 triangles. 

Farley [5] demonstrated that 2-trees are isolated failure immune (IFI) net- 
works. Wald and Colbourn [13] showed that a minimum IFI network is a 2-tree. 
This fact made 2-trees an important class of fault tolerant networks. Wald and 
Colbourn [13] also showed that the Steiner tree problem on a series-parallel graph 
is tractable, presenting a linear time algorithm. In a recent paper, Colbourn and 
Xue [3] showed that the grade of service Steiner tree problem [16] is also tractable 
on a series-parallel graph. 

In this paper, we show that the weight constrained minimum cost Steiner tree 
problem is NP-hard even when the graph G is a series-parallel graph. We then 
present a fully polynomial time approximation scheme for computing a weight 
constrained minimum cost Steiner tree in a series-parallel graph. 

2 WCSTP in S-P Graphs Is Intractable 

When the number of target vertices is zero or one, the WCSTP is trivial. When 
there are only two vertices in T, the WCSTP becomes the well-known weight 
constrained shortest path problem (WCSPP) which is known to be NP-hard [6]. 
To the best of our knowledge, no one has ever addressed the complexity of WC- 
STP or WCSPP in an S-P graph specifically. In a recent paper [14], Wang and 
Crowcroft presented an NP-hardness proof for the WCSPP problem in general 
graphs. We point out that the NP-hardness proof of Wang and Crowcroft, al- 
though targeted at the problem in general graphs, also proves that the WCSPP 
is NP-hard in an S-P graph. Since the WCSPP in an S-P graph is NP-hard, the 
more general WCSTP in an S-P graph is also NP-hard. Therefore we have the 
following hardness result (for the completeness of the current paper, we include 
an NP-hardness proof of the WCSPP in an S-P graph in the appendix). 

Theorem 1. The WCSTP in an S-P graph is NP-hard. □ 

Note that in the definition of the WCSTP, the weight and cost of an edge 
are symmetric. Therefore we may also talk about cost constrained minimum 
weight Steiner tree problem (CCSTP) which asks for a minimum weight tree 
subject to cost constraint. Given the hardness of the problem, we are interested 
in designing efficient approximation algorithms for this problem. In the next 
section, we will present a pseudo-polynomial time algorithm for computing a 
cost constrained minimum weight Steiner tree in an S-P graph. We will then 
apply standard techniques of scaling and rounding to turn the pseudo-polynomial 
time algorithm into a fully polynomial time approximation scheme (FPTAS) for 
weight constrained minimum cost Steiner trees. 
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3 A Pseudo Polynomial Time Algorithm for CCSTP 

In this section, we will present a pseudo polynomial time algorithm for computing 
a cost constrained minimum weight Steiner tree. Given the set T C G and a 
nonnegative integer C, the algorithm computes a minimum weight Steiner tree 
among those whose cost is bounded by in O(nC^) time, where n is the number 
of vertices in the series-parallel graph. 

Our algorithm for computing a weight-constrained Steiner tree operates in 
two phases. The first completes the graph to a 2-tree, as was done in [13], where 
each added edge has a weight of zero and a cost that is larger than the cost 
constraint C. This ensures that the added edges will never be chosen in a cost 
constrained minimum weight Steiner tree. 

The second phase involves finding cost constrained minimum weight Steiner 
trees in 2-trees. 

We are given a 2-tree G with target set T G V . With each (directed) arc 
a = {x, y) corresponding to an (undirected) edge {x, y} of G, we will associate 
6C + 6 measures, which summarize the weight incurred so far in the subgraph S 
which has been reduced onto the edge {x,y}: 

— st{a, is the minimum weight of a Steiner tree for S, with a cost of at most 

in which x and y appear in the same tree, ^ = 0, 1, 2, . . . , C. 

— dt{a,^) is the minimum weight of two disjoint trees for S including all tar- 
gets, one tree involving x and the other involving y, whose total cost is no 
more than ^ = 0, 1, 2, . . . , ^. 

— yn{a, is the minimum weight of a Steiner tree for S, which includes x but 
not y, whose cost is at most ^ = 0, 1, 2, . . . , (^. 

— ny{a, is the minimum weight of a Steiner tree for S, which includes y but 
not x, whose cost is at most ^ = 0,1,2,...,^. 

— nn{a, is the minimum weight of a Steiner tree for S, which includes neither 
X nor y, whose cost is at most ^ = 0 , 1, 2, . . . , ^. 

— none{a,^) is the weight of omitting all vertices of S from a Steiner tree 
whose cost is at most ^ = 0, 1, 2, . . . , ^ (note that there is a big penalty 
here is S contains any target vertices). 

Let f3 = (x,y) be the other arc corresponding to the edge {x,y}. Then the 
6C + 6 measures associated with /3 relates to the measures associated with a in 
the following way: 



st{f3,0 = 

ny{fd,0 = yn{a,C) nn{f3,^) 



dt{a,i) yn{(3,C) 

nn{a,^) none{(3,^) 



ny{a,C) 
none{a, ^) 



Let BIG be an integer which is bigger than the sum of all the edge weights in 
the given graph. Initially, no reduction of the graph has been done. We set the 
initial measures for each arc a = (x,y) corresponding to an edge e = {x,y} as 
follows, for ^ = 0,1,2,..., 

Our algorithm (call it Algorithm 1) will use these initial arc measures to 
recompute the arc measures as the graph is reduced. The graph is reduced by 
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Table 1. Initialization of arc measures 



Initialization Of Measures 
st{a,^) = w{e) if c(e) < BIG otherwise; 
dt{a,^) = 0; 

yn{a,^) — BIG y ^T, 0 otherwise; 
ny{a,0 = big if a; G T, 0 otherwise; 
nn{a, = BIG if X (zT or y (zT, 0 otherwise; 
none(a,^) — BIG if x G T or y G T, 0 otherwise. 



repeated deletion of degree-2 vertices. The algorithm stops when there is no 
degree-2 vertex left. Suppose at some point in the algorithm there is a triangle 
{x, y, z} in which j/ is a degree-2 vertex and that costs have been computed for 
both (x,y) and (y,z). Let L = (x,y), R = {y,z), M = {x,z). We will update 
the measures at M using the following rules: 

— For ^ = 0, 1, 2, . . . , C, let w^^st be the minimum of the following four sets of 
numbers: 

{st{M,^M) + st{L,^L) + dt{R,^R)\S,L + ^R + in = > 0 ,^r > 0,^m > 0}; 

{st{M,^M) + dt{L,^L) + st{R,^R)\^L + ^R + > 0,^R > 0,^M > 0}; 

{st{M,^M) + yn{L,^L) + ny{R,^R)\^L + Ck + Cm = CjCl > 0;Cr > 0)Cm > 0}; 

{dt{M,^M) + st{L,^L) + st{R,^R)\^L + Cfl + Cm = C>Cl > 0:Cfl > 0 )Cm > 0}. 

— For C = 0, 1, 2, . . . , C, let be the minimum of the following three sets of 
numbers: 

{dt{M,^M) + st{L,^L) + dt{R,^n)\^L + Ci? + Cm = CiCl > 0)Cfl > OjCm > 0}; 
{dt{M,^M) + dt{L,^L) + sf(i?,Cfl)|Ci + Cr + Cm = C)Cl > O^Cfl > 0 :Cm > 0}; 
{dt{M,^M) + yn{L,^L) + ny(i?,Cfi)|Ci + Cfl + Cm = CjCl > 0;Ck > 0)Cm > 0}. 

— For C = 0, 1, 2, . . . , C, let w^^yn be the minimum of the following two sets of 
numbers: 

{yn{M ,Cm) + yn{L,^L) + none(i?, C_ r) |Cl + Cfl + Cm = CjCl > OjCr > OjCm > 0}; 
{yn{M,^M) + st{L,^L) + 2/n(i?,C_R)|Ci + Cfl + Cm = CjCl > O^Cfi > 0 )Cm > 0}. 

— For C = 0, 1, 2, . . . , C, let w^^ny be the minimum of the following two sets of 
numbers: 

{ny{M,^M) + ny{L,^L) + st{R,^R)\^L + Cfl + Cm = CjCl > 0)C« > 0;Cm > 0}; 
{ny{M,^M) + none{L,^R) + ny(i?, C_ r)|Cl + Cfl + Cm = CjCl > OjCr > 0>Cm > 0}- 

— For C = 0, 1, 2, . . . , C, let w^^nn be the minimum of the following four sets of 
numbers: 



{nn{M,^M)+none{L,^L)+none{R,^R)\S,L+S,R+iM = CjCl > OjCr > OjCm > 0}; 
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{none{M,^M)+nn{L,^L)+none{R,^R)\^L+^R+iM = > 0 ,^r > 0 ,^m > 0 }; 

{none{M,^M)+none{L,^L)+nn{R,^n)\^L+^R+^M = > 0 j'?m > 0 }; 

{none(Af,^M) + ny{L,^L) + 2/?^(-R,C-r)ICl + Cfl + Cm = C>Cl > 0>Ci? > 0 >Cm > 0}. 

— For C = 0)1)2,...,C, let w^^none be the minimum of the following set of 
numbers: 

{none(M,CM) + ?t^one(L,CL) + none(i?,Cfl)|CL + Cfl + CM = C,Cl > O.Cfl > 0 ,Cm > 

0}. 

— Delete the vertex y from the graph. For C = 0, 1, 2, . . . , C, update the values 

of st{M, C), dt{M, C), yn{M, C), ny{M, C), nn{M, C), and none{M, C) to w^^st, 
w^,dt, Wi,yn, w^^ny, and w^^none, respectively. 



Theorem 2. When the 2-tree is reduced to a single edge a = {x,y}, the min- 
imum of the four values st{a,Cf), yn{a,Cf), ny{a,Cf), nn{a,Cf) is the minimum 
weight of a Steiner tree interconnecting all the vertices in T with a cost not 
more than f, where a weight o/BIG or more indicates the nonexistence of a cost 
constrained Steiner tree. 

Proof. We note that the measures of an edge a = {x,y} are initialized cor- 
rectly, and are updated correctly after each reduction. For example, for any 
C G {0, 1, 2, . . . , C} and an edge a = {x, y}, st(a,^) is the minimum weight of a 
Steiner tree for S whose cost is no more than C, in which x and y appear in the 
same tree. 

When the graph is reduced to a single arc a = {x,y}, st{a,f) is the mini- 
mum weight of a cost constrained Steiner tree interconnecting x, y, and all the 
vertices in T; yn{a, Cf) is the minimum weight of a cost constrained Steiner tree 
interconnecting x (but not y) and all the vertices in T — {y}; ny{a,Cf) is the 
minimum weight of a cost constrained Steiner tree interconnecting y (but not 
x) and all the vertices in T — {a:}; nn{a,Cf) is the minimum weight of a cost 
constrained Steiner tree interconnecting y (but not x) and all the vertices in 
T — {x, y}. Note that leaving out a target vertex from the tree receives a penalty 
of at least BIG. Therefore the minimum of the above four values corresponds to 
the minimum weight of a cost constrained Steiner tree. □ 

Theorem 3. Algorithm 1 can he implemented in 0{nCf^) time, where C, is the 
cost constraint and n is the number of vertices in the graph. 

Proof. We will show that the update of measurements can be accomplished 
in O(C^) time for each reduction. Since there are 0{n) reductions, this would 
complete the proof of the theorem. 

Since there are 0(() choices for each of f, and fr {f,R = f — f,M — £,l), a 
simple minded implementation would require O(C^) time to update the measures 
for each reduction. 
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Let us denote by ^lr- Although there are O(C^) different choices for 

{^lAr)j there are only 0{() different choices for ^lr- Using O(C^) time, we can 
compute the following 12(^ + 1) measures for all ^lr G {0,1,2,..., ^}. 

LR{^lr, st, st) = min{st(L,^z,) + st{R,^LR ~ ■?l)|0 < < ^lr}; 

LR{iLRi st, dt) = min|st(L, + dt{R, |0 < < ^lr}', 

LR{^lr, dt, st) = mm{dt{L, ^l) + st{R, ^lr ~ S,l) |0 < < ^lr}] 

LR{^LR,yn,ny) = min{yn(L, ^l) + ny{R,^LR ~ Cl)|0 < < ^lr}; 

LR{^lr, yn, none) = mm{yn{L, ^l) + none{R, ^lr ~ £,l) |0 < < S,lr}', 

LR{^lr, st, yn) = mm{st{L, ^l) + yn{R, S,lr - ?l) |0 < Cl < Clk}; 
LR{^lr, ny, st) = mm{ny{L, Cl) + st{R, ^lr - Cl) |0 < Cl < Cla}; 
LR{^lr, none, ny) = mm{none{L, Cl) + ny{R, ^lr ~ Cl) |0 < Cl < Cla}; 

LR{^lr, none, none) = min|none(L, Cl) + none{R,^LR ~ Cl) |0 < Cl < Cla}; 

LR{^lr, nn, none) = min{nn(L, Cl) + none{R, ^lr ~ Cl) |0 < Cl < Cla}; 

LR{^lr, none, nn) = min{none(L, Cl) + nn{R, ^lr ~ Cl) |0 < Cl < Clk}; 

LR{^LR,ny,yn) = mm{ny{L,^L) + yn{R,^LR ~ Cl) |0 < Cl < Cla}- 

After those 12(C + 1) measures are computed, only O(C^) time are neces- 
sary to compute the updated values for st(M, C), dt{M,^), yn{M,^), ny{M,^), 
nn{M, C), none{M,^), for C = 0, 1, 2, . . . , C. For example, w^^yn can be computed 
as the minimum of 

min{yn(M,CM) + LR{^ - ^M,yn,none)\Q < Cm < Cl 

and 

min|i/n(M,CM) + Li?(C - Cm, st.yn) |0 < Cm < Cl- 

This shows that Algorithm 1 can be implemented in O(nC^) time. □ 

We point out that the weight constrained Steiner tree can be constructed in 
0(nC) extra time time if we perform some bookkeeping operations (recording 
how the minimum was achieved) during the reductions. 



4 An FPTAS for WCSTP 

We use standard techniques of scaling and rounding [2,7,9,12,17] to turn the 
pseudo polynomial time algorithm for CCSTP into a fully polynomial time ap- 
proximation scheme for WCSTP. 

Let us use c(T, W) to denote the minimum cost of a Steiner tree spanning the 
targets in T with a weight of no more than W. Given a positive real number C, 
deciding whether c{‘T,yV) > C or c{‘T,W) < C is NP-hard. Using the standard 
technique of scaling and rounding [2,7,9,12,17], we can decide, in fully polynomial 
time, whether c(T, W) > C or c(T, W) < (1 -I- e)C, for any given constant 
e > 0. This technique will play an important role in our FPTAS for computing 
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a weight constrained minimum cost Steiner tree in an S-P graph. We describe 
this approximate testing in Algorithm 2 as TEST . 

Algorithm 2 TEST (C, e) 

Step_1 Set 6 := Let eg be the scaled edge cost function such that ce(e) = [c(e) x 6\ 

for e € E\ Set ( := C x 9; 

Step_2 Apply Algorithm 1 using the scaled edge cost function ce instead of the original edge 
cost function c; 

if the weight of the cost constrained Steiner tree is no more than W then 
output YES ; 
else 

output NO ; 
endif 



Theorem 4. Let us be given the target set T, the weight constraint W, the 
positive real numbers C and e. //TEST(C,e) = NO, then c{T,W) > C. If 
TEST(C,e) = YES, then c{T,yV) < (1 -I- e) x C. In addition, the worst-case 
time complexity o/TEST(C,e) is 0{^). 

Proof. Let T be a tree subgraph in G. Let c(T) = ^(e) and cg{T) = 

X^eGT ce(e). Since the number of edges in T is at most 2n — 3, we can prove that 
ce{T) < C implies c(T) < C(1 -I- e) and that cg{T) > f implies c(T) > C, where 
0 and C are as defined in Algorithm 2. 

Assume that TEST(C, e) = NO. Then we know that for any tree T spanning 
the targets in T with a weight no more than fV, we must have cg(T) > which 
in turn implies c(T) > C. This says that c(T,W) > C. 

Now assume that TEST(C,e) = YES. Then we know that there is a tree 
T spanning the targets in T such that w{T) < >V and cg{T) < f. Note that 
cg{T) < C, implies c(T) < C(1 -I- e). This says that c(T,W) > C(1 -I- e). 

The time complexity of TEST follows directly from Theorem 3. □ 

We will use LB and UB to denote lower and upper bounds on c(T, W). Our 
FPTAS starts with efficiently computable values of LB and UB and uses bisection 
to reduce the ratio 

An initial value of LB can be computed as follows. Use Algorithm 1 to com- 
pute a minimum weight Steiner tree T whose cost is 0. This clearly can be done 
in polynomial time. If the weight of this tree is no more than W, it is also 
a weight constrained minimum cost Steiner tree and we are done. Otherwise, 
c(T, W) > 1 and we can use 1 as the initial value of LB. 

An initial value of UB can also be computed easily. Use the algorithm in 
[13] to compute a minimum weight (ignore the cost) Steiner tree T spanning 
the target vertices in T. If w{T) > W, there is no Steiner tree whose weight is 
bounded by W. When w{T) < W, we may use c{T) as the initial value of UB. 

Let B be some chosen real number which is grater than l-|-e/3. We will apply 
bisection to drive the ratio ^ down to some number below B. Suppose that our 
lower bound LB and upper bound UB are such that UB > LBxB > LBx (l-|-e/3). 

Let C = ^+e /3 ■ If TEST(C, e/3) = NO, then C is also a lower bound for c(T). 
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If TEST(C,e/3) = YES, then (1 + e/3) x C is also an upper bound for c(T). 
Therefore, the ratio of the new upper bound over the new lower bound will be 

no more than x (1 + e/3). Let us call the above process an iteration. Note 
that such an iteration can be accomplished in fully polynomial time (by Theo- 
rem 4). Furthermore, the ratio of the upper-bound over the lower bound can be 
reduced to a number below B in polynomial number of iterations (polynomial 
in the input size of the given instance and i). This analysis leads to out FPTAS 
which is given in Algorithm 3. 



Algorithm 3 FPTAS for weight constrained minimum cost Steiner tree in an S-P graph. 
step.1 set B := (1 -I- e/3) x (1 -|- e/3); 

set LB and UB to their initial values; 

Step^ if UB < B X LB then 
goto Step J; 
else 

if TEST(C, e/3) = NO, set LB = C; 
if TEST(C, e/3) = YES, set UB = C x (1 -I- e/3); 
goto StepJZ; 
endif 

Step.3 set e := il^; c,-.= 6x UB; 

setc«(e) := [6 x c(e)J for every e 6 if; 

apply Algorithm 1 to compute a cost constrained minimum weight Steiner tree using 
the scaled cost function eg . 



Theorem 5. If there is no weight constrained Steiner tree spanning the target 
vertices in T, we will find this out during our computation of the initial value 
of UB. If T has a weight constrained Steiner tree, Algorithm 3 finds a weight 
constrained Steiner tree T such that w{T) < W and c{T) < (1 -L e) x c{T,W). 
Furthermore, the time complexity of Algorithm 3 is x (log5 -I- log(i)), 

where S is the input size of the given instance. 

Proof. The claim follows from standard techniques [2,7,17] and Theorems 2- 
4. □ 

5 Conclusions 

In this paper, we have studied the weight constrained minimum cost Steiner 
tree problem on a very important class of sparse graphs-series-parallel graphs. 
Although most Steiner tree problems are tractable on series-parallel graphs, it 
is shown that the weight constrained minimum cost Steiner tree problem on 
series-parallel graphs is NP-hard. On the positive side, we have presented a fully 
polynomial time approximation scheme for this problem, which has applications 
to computer networks as well as computational biology. A more challenging prob- 
lem is the weight constrained minimum cost Steiner tree problem on a general 
graph. 
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A NP-Completeness of WCSTP in S-P Graphs 

In this section, we prove that the weight constrained shortest path problem [7] 
in S-P graphs is NP-hard. After we have arrived at this proof, we found that 
the proof by Wang and Crowcroft [14] also proves the same fact. This proof is 
included here as an aid to the referees. 

Theorem 6. The weight constrained shortest path problem in series-parallel 
graphs is NP-hard. 

Proof. Now we transform PARTITION to the weight constrained shortest path 
problem in S-P graphs. 

Suppose we have a set S with n elements and every element a has a corre- 
sponding size s(a) S J2aes ~ construct a S-P graph (see Figure 

1) as follows. 

- V = {vi,V2, . . . ,Vn+l,Ui,U2, . . . , Mn}, 

- E = {{vi,Vi+i)\i = 1,2, . . . ,n} U {(ui,Ui), = l,2,...,n}, 

- For e = {vi,Vi+i),c{e) = s{ai),w{e) = 0. 

- For e = or (v^+i,u^), c(e) = 0,w(e) = ^s(ai). 

For this S-P graph, we want to find a shortest path from vi to Vn+i such 
that the total weight of this path is no larger than A. 

If we can find a partition of S' = S 1 US 2 such that “ SaGS 2 “ 

A, then we have a path from v\ to consist of the edges {vi, Ui+i) for £ Si, 
and (ui, Ui), (ui, Ui+i) for £ S 2 . The total cost and total weight of this path 
are all equal to A. This path is obviously the shortest path under the restriction 
of total weight no larger than A. 

Conversely, assume that we have find a shortest path from v\ to Vn+i with 
the total weight no larger than A. From the construction of the S-P graph, we 
can get a partition of S by setting Si = {ai|(ui,Ui+i) is in the shortest path}. 

From above discussion, we know that the weight constrained shortest path 
problem in S-P graph is NP-hard. □ 




Fig. 1. Hardness proof of WCSPP. 
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Abstract. We consider generalized discrete timed automata with gen- 
eral linear relations over clocks and parameterized constants as clock con- 
straints and with parameterized durations. We look at three approxima- 
tion techniques (i.e., the r-reset-bounded approximation, the B-bounded 
approximation, and the {B, r)-crossing-bounded approximation), and de- 
rive automata-theoretic characterizations of the binary reachability un- 
der these approximations. The characterizations allow us to show that 
the safety analysis problem is decidable for generalized discrete timed 
automata with unit durations and for deterministic generalized discrete 
timed automata with parameterized durations. An example specification 
written in ASTRAL is used to run a number of experiments using one 
of the approximation techniques. 



1 Introduction 

As a standard model for analyzing real-time systems, timed automata [3] have 
received enormous attention during the past decade. A fundamental result in the 
theory of timed automata shows that region reachability for timed automata is 
decidable [3] . This result has been very useful in defining various real-time logics, 
appropriate model checking algorithms and tools [2,4,16,17,24,25,27,28,29] for 
verifying real-time systems (see [1] for a survey). 

However, not every real-time system can be modeled as a timed automaton 
since a complex (not necessarily large) real-time system might contain non-region 
(e.g., Presburger) clock constraints. However, the “Turing computing” power of 
such augmented automata prevents automatic verification of properties such as 
reachability [3,5]. Therefore, decidable approximation techniques are of great 
interest, since these techniques would provide a user some degree of confidence 
in analyzing and debugging complex real-time specifications. In contrast to the 
most direct approximation techniques [5,10,11,12] that bound the number of 
transitions to a fixed number, the approximation techniques presented in this 
paper restrict the clock behaviors but do not necessarily bound the number of 
transition iterations to be finite. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 529-539, 2001. 
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In this paper, we focus on timed automata with integer- valued clocks, i.e., dis- 
crete timed automata, and extend them to generalized discrete timed automata 
by allowing general linear relations over clocks and parameterized constants as 
clock constraints. Furthermore, the duration of a transition can be a parame- 
terized constant. These generalizations have practical motivation. For example, 
many complex real-world specifications [6,8,10,22] written in the real-time spec- 
ification language ASTRAL [6] use generalized clock constraints and parameter- 
ized durations in almost every specification. Therefore, the results presented in 
this paper may be useful in implementing an automatic specification debugger 
for complex real-time specification languages like ASTRAL. 

We investigate three approximation techniques in this paper. The r-reset- 
bounded approximation bounds the number of clock resets by a given positive 
integer r for each clock. The R-bounded approximation requires that each clock 
reset happens before its value exceeds a given (integral) time B. Combining these 
two, the (R, r)-crossing-bounded approximation requires that for each clock, 
there are at most r times that the clock resets after its value exceeds R. Given 
an approximation technique, we will focus on the binary reachability character- 
ization of a generalized discrete timed automaton. Binary reachability has re- 
cently been proposed and investigated for a number of timed systems [7,9,14,15], 
which makes it possible to reason about “non-region” properties for these sys- 
tems. We first show that, when a generalized discrete timed automaton has unit 
duration, the binary reachability under any one of the three approximations is 
Presburger. Then by considering a generalized discrete timed automaton with 
parameterized durations, we show that the binary reachability under any one of 
the three approximations has a 2DCM-padding when the machine is determin- 
istic. Specifically, we show that the “padded language” for binary reachability 
can be accepted by a deterministic two-way counter machine with one reversal- 
bounded counter [20]. The case for nondeterministic generalized discrete timed 
automata is open. These are good characterizations in the sense that the validity 
of Presburger formulas as well as the emptiness problem for these counter ma- 
chines are decidable. This allows us to establish, in principle, decidable results for 
the (Presburger) safety analysis problem for generalized discrete timed automata 
under the approximations. The 2DCM-padding characterization is particularly 
interesting, since binary reachability is not necessarily semilinear. 

Due to space limitation, all the proofs are omitted. For a complete exposition 
see [13]. 

2 Preliminaries 

Let P be a finite set of variables over the integers. An atomic linear relation on 
V is defined as Sy^yO'vV < b, where a„ and b are integers. A linear relation on V 
is constructed from a finite number of atomic linear relations using -■ and A. Cv 
denotes the set of all linear relations on V . An atomic Presburger relation on V is 
either an atomic linear relation on P or a linear congruence Sy^yayV = b mod d, 
where ay , b and d are integers. A Presburger formula can always be constructed 
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from atomic Presburger relations using -■ and A. Let N be the set of integers 
with N~^ denoting the set of nonnegative integers. A clock is a variable over 
N~^ . A generalized discrete timed automaton ^ is a tuple {S, C, X, E) where S 
is a finite set of (control) states, X is a finite set of clocks, C is a finite set 
of parameterized constants, and if C S' x (C U {0, 1}) x 2^ x Cxuc x S is a 
finite set of edges. Each edge (s, d, A, I, s') denotes a transition (or an edge) from 
s to s' with enabling condition I G Cxuc- d G C U {0, 1} is a parameterized 
constant indicating the duration of this transition. A C X is the set clocks that 
are reset as a result of this transition. When A is empty, the duration, which is 
a parameterized constant in C or integer constant 1, must be positive. Clock 
resets take no time. Thus, when A is not empty, the duration d on this edge is 
simply 0. 

The semantics is defined as follows, a £ S x x is called 

a configuration with aq being the state under this configuration, and Ocj 
denote the value of the clock Xi and the value of the parameterized constant Cj, 
respectively. Note that each clock and parameterized constant are nonnegative, 
i.e., in X+. a — > q,' denotes a one-step transition along an edge in A 
satisfying 

— Constant values do not change, i.e., for each c G C, «c = Q^c) 

— The state s is set to a new location s', i.e., aq = s,a'q = s', 

— Each clock changes according to the edge given. When there is no clock 
reset on this edge, i.e., A = 0, all the clocks synchronously progress by the 
amount of the duration, i.e., for each x G X, = ax + ad- In this case, the 
duration is positive, i.e., ad > 0. When A yf 0, clocks in A reset to 0 and all 
the other clocks are unchanged. Thus, clock resets take no time. That is, for 
each a: G A, = 0 and for each x ^ \, a'^ = a^- 

— The enabling condition is satisfied. That is, l{a) is true. 

We simply write a ^ a' ii a can reach a' by a one-step transition. A is deter- 
ministic if for any configuration a there is at most one (3 such that a ^ f. A path 
ao - - - ak satisfies ai — >■ Oi+i for each i. Write a /? if a reaches f through a 
path. In the definition of A, there is no input. This is because the input is always 
one-way for timed automata and, therefore, input symbols can be built into the 
control states. A has parameterized durations. If we restrict each duration on an 
edge without clock resets to be 1, then A is called a generalized discrete timed 
automaton with unit durations. 

When A with unit durations contains no parameterized constants and en- 
abling conditions are clock constraints or clock regions in the form of Boolean 
combinations of x — y)(c and x#c, where c is an integer, x and y are clocks and 
4) G {<, =, >}, A is equivalent to the standard timed automata (with integer- 
valued clocks) [3]. On the other hand, when A with unit durations contains 
enabling conditions only in the form of Boolean combinations of x4(c, where c 
is a parameterized constant or an integer, x is a clock (when c is a parame- 
terized constant, x is called a parameterized clock), xl is a parameterized timed 
automaton (with integer-valued clocks) [5]. 
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There are two kinds of reachability for A. They are state reachability and 
binary reachability. Assume a state sq is designated as the initial state of A. A 
state s G S' is state reachable in A if there is an initial configuration ag (whose 
state is sg and whose clock values are all 0) and a configuration a with aq = s 
such that a is reachable from the initial configuration, i.e., «g o:- The state 
reachability set is the set of all states s such that s is state reachable in A. The 
state reachability problem of A is whether a state s G S is state reachable. The 
following are some known results about the state reachability problem. 

Theorem 1. (1). The state reachability problem is decidable for standard timed 
automata [3]. (2). The state reachability problem is decidable for parameterized 
timed automata with only one parameterized clock (but can have many unpa- 
rameterized clocks) [5]. (3). The state reachability problem is undecidable for 
generalized discrete timed automata. 

Actually, Theorem 1 (3) follows from the following special cases. 

Theorem 2. (1). The state reachability problem is undecidable for standard 
timed automata when we allow “+” operations in clock constraints, e.g., x-\- 
y — z < 5 [3]. (2). The state reachability problem is undecidable for parameter- 
ized timed automata (with more than 2 parameterized clocks) [5]. 

On the other hand, binary reachability is the set of all configuration pairs 
(a,P) such that a P (we use ^^,4 in the paper). Characterizations of the 
binary reachability of timed automata have recently been established. 

Theorem 3. (1). The binary reachability of timed automata with real valued 
clocks is definable in the additive theory over reals and integers [7,9]. (2). The 
binary reachability of timed automata with integer valued clocks is definable by 
a Presburger formula [7,15]. 

Characterizations of binary reachability help us to reason about non-region 
properties of timed systems. For instance, consider the following property: “for 
every configuration a there exists a configuration (3 such that a [3 and 
the clock xi in [3 is the sum of clocks X\ and X 2 in a.” Though constraint 
]3xi = Oix., + 0^2 i® ’^ot in the form of a clock region, this property can be 
automatically verified for timed automata [7,9,15]. 

However, for generalized discrete timed automata, the binary reachability 
is too strong to have an interesting characterization. In particular, even the 
membership problem for binary reachability, i.e., deciding whether two given 
configurations a and ]3 satisfy a /? for a generalized discrete timed automa- 
ton A, is undecidable. This follows from the fact that a two-counter machine can 
be simulated by a generalized discrete timed automaton A, as shown in the the 
proofs of Theorem 2(1) (2) [3,5]. Thus, the membership problem can be reduced 
to the halting problem for two-counter machines, which is undecidable. 

Theorem 4. The membership problem for binary reachability is undecidable for 
generalized discrete timed automata. 
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This undecidability result for generalized discrete timed automata leads us 
to consider the following three approximations of Let r and B be fixed 
positive integers. The first approximation is r-reset-bounded reachability. A path 
is called r -reset-hounded if each clock resets at most r times. Write 
a (3 if a reaches /3 through an r-reset-bounded path. The second approxi- 
mation is B-bounded reachability. A path is called B-bounded if for 

each j < k, each Xi G X, |a^. — < B. Write a P if a reaches P 

through a B-bounded path. The third approximation is {B ,r) -crossing-hounded 
reachability. A path a'^a^ ■ ■ ■ is called (B, r) -crossing-bounded if there are at 
most r many t’s such that |a^. — > B. Write a P if a reaches P 

through a (B, r)-crossing-bounded path. 

The main results in this paper show that the three approximations of binary 
reachability '^,4 have decidable characterizations, i.e., they can be accepted by 
a class of machines with a decidable emptiness problem. Before we proceed to 
show the results, some further definitions are needed. 

A nondeterministic multicounter machine (NCM) is a nondeterministic ma- 
chine with a finite number of states, a one-way input tape, and a finite number 
of integer counters. Each counter can be incremented by 1, decremented by 1, 
or stay unchanged. These counter assignments are called standard assignments. 
In addition, a counter can be tested against an integer constant. These tests are 
called standard tests. An NCM is reversal-hounded if each counter is reversal- 
bounded (i.e., it changes mode between nondecreasing and nonincreasing for 
some bounded number of times). 

A tuple of integers can be encoded as a string by concatenating the unary 
representations of each integer with a separator. In this way, the binary reach- 
ability can be treated as a language {[a]#[/3] : a /?}, where [a] and 
[P] are string encodings of configurations a and P separated by a delimiter . 
Obviously, same encoding applies to '^m, the binary reachability of an NCM 
M. 

When an NCM is reversal-bounded, the emptiness problem is decidable. In 
fact, we have a stronger characterization. 

Theorem 5. A set of n-tuples of integers is definable by a Presburger formula 
iff it can he accepted by a reversal-bounded deterministic multicounter machine 
[19]. 

The machines defined above, when used as language recognizers, have a one- 
way input tape. Suppose a two-way input is used instead. Let 2DCM(c, r) denote 
the class of deterministic machines with a two-way input tape and c r-reversal- 
bounded counters. Then the emptiness problem for 2DCM(c, r) when c > 2 and 
r > 1 is undecidable [19]. An interesting special case is when c = 1, i.e., there 
is only one counter. A language is 2DCM-recognizahle if it can be accepted by a 
2DCM(l,r). 

Theorem 6. The emptiness problem for 2DCM-recognizahle languages is decid- 
able [20]. 
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It is still open whether Theorem 6 holds for nondeterministic machines. That 
is, whether the emptiness problem for 2NCM(l,r), which is a nondeterministic 
r-reversal-bounded one counter machine with a two-way input tape, is decidable. 

Given a generalized discrete timed automaton A, consider a subset of config- 
uration pairs, C S x x x S x x One can 

look at R_a as some sort of reachability relation. For a given A, if a 2DCM(1, r) 
M _4 can be effectively constructed such that for every w, w is in i ?_4 iff there ex- 
ists a w' such that M _4 accepts w^w', then we say that i ?_4 has a 2DCM-padding. 
From Theorem 6 , it is routine to show the following lemma. 

Lemma 1. (1). The emptiness problem for having 2DCM-paddings is 

decidable. (2). If R\ and R\ have 2DCM-paddings, then so do the join R\\JF(^ 
and the composition R\ o . 

3 Main Results 

Denote to be the binary reachability of a generalized discrete timed automa- 
ton A through a path without clock resets. Note that A itself can be considered as 
a nondeterministic multicounter machine. However, tests in A, which are linear 
relations on clocks and parameterized constants, are not standard. Assignments 
in A in the form oi x := x + d with d a parameterized constant are also not stan- 
dard. But, if A has only unit durations, the assignments are standard except 
when a clock reset occurs, i.e., x := 0. The following result follows our recent 
results on strong-reversal-bounded NCMs [21]. 

Lemma 2 . Suppose A is a generalized discrete timed automaton with unit du- 
rations. Then is Presburger. 

The following theorem gives a characterization of the three approximations 
of '^A when A has unit durations. The proof cuts a reachability path of A into 
a finite number of phases and each phase can be further characterized by 
using Lemma 2. 

Theorem 7. Suppose A is a generalized discrete timed automaton with unit 
durations. Then ^[ 4 , "^a Presburger. 

The case for A with parameterized durations is more complicated. In prin- 
ciple, A with parameterized durations can be simulated by an A' with unit 
durations. This is done by simply introducing a new clock z to test whether 
the parameterized duration of a transition is reached, and after the transition is 
fired z is reset to 0. However, the three approximations on A are not equivalent 
to those on A' . The reason is as follows. Consider '^[ 4 , the 0-reset-bounded ap- 
proximation of A. a '^[4 /? if a can reach (3 through a path without clock resets. 
But for A! each transition with a parameterized duration causes clock z to reset. 
Thus, a path in A' witnessing a '^[4 (3 could have an unbounded number of 
clock resets. Therefore, for A with parameterized durations. Theorem 7 is not 
applicable. 
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In the following, we consider a generalized discrete timed automaton A (with 
parameterized durations). Currently, we cannot show a decidable characteriza- 
tion of ^^ 4 , since we find it is related to the emptiness problem of 2NCM(l,r), 
which is still open. However, by restricting A to be deterministic, the follow- 
ing results can be established. The proof uses the fact that the disjunction of 
the atomic linear relations that appear in all the enabling conditions of A is 
equivalent to a union of convex linear relations. 

Lemma 3. has a 2DCM-padding for a deterministic generalized discrete 
timed automaton A. 

Again, by using Lemma 3 and the idea of cutting a reachability path of A 
into phases, the following theorem gives a characterization of the three approx- 
imations of '^A ■ 

Theorem 8. Suppose A is a deterministic discrete timed automaton. Then, 
'^A have 2DCM-paddings. 

4 Verification of Safety Properties 

Consider P and /, two sets of configurations of a generalized discrete timed 
automaton A definable by Presburger formulas. If, starting from a configuration 
in I, A can only reach configurations in P, then P is a safety property with 
respect to the initial condition I . The following is an example: 

“starting from a configuration satisfying x\ — X 2 + x^ — Xj^ > c + d, A cannot 
reach a configuration satisfying 2xi + X 2 < cA X 3 — 8 x 4 > 2d — c.” 

The safety analysis problem is to determine whether P is a safety property 
with respect to the initial condition I. The following two theorems follow from 
Theorem 7, Theorem 8, and Lemma 1. 

Theorem 9. The safety analysis problem is decidable for generalized discrete 
timed automata with unit durations and with any one of the following approxi- 
mations: r -reset-boundedness, B -boundedness and {B ,r) -crossing-boundedness. 



Theorem 10. The safety analysis problem is decidable for deterministic gen- 
eralized discrete timed automata with any one of the following approximations: 
r-reset- boundedness, B -boundedness and {B,r) -crossing-boundedness. 

Remark: Theorem 10 can be strengthened. The class of languages accepted by 
deterministic two-way counter machines with one reversal-bounded counter is 
closed under Boolean operations [20]. It follows that Theorem 10 remains valid 
even if the sets of configurations P (property) and / (initial condition) are sets 
accepted by these machines. I 

It is desirable to consider the decidability of the safety analysis problem 
for generalized discrete timed automata under some special form but without 
using the approximations. One such form is parameterized timed automata with 
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only one parameterized clock. As stated in Theorem 1 (2), the state reachability 
problem is decidable. However, surprisingly, the safety analysis problem, i.e., 

“Deciding whether P is a safety property with respect to the initial condition 
I for a parameterized timed automaton with only one parameterized clock, where 
both P and I are definable by Presburger formulas” 

is still open. This problem is closely related to the open problem of the decidabil- 
ity of the emptiness problem for 2NCM(l,r). It is also worthwhile to point out 
that the characterizations of the binary reachability of deterministic generalized 
discrete timed automata under the approximations, as shown in Theorem 8, are 
not necessarily Presburger. In fact, 2DCM(I,r) can accept a class of nonlinear 
languages [20]. 

5 A Verification Example 

In practice, allowing parameterized clock constraints and durations makes it 
possible to specify more complex real-time systems. In this section, we take 
an example specification [23] of the railroad crossing benchmark [18], which is 
written in ASTRAL [6]. The specification specifies a system consisting of a set 
of railroad tracks that intersect a street where cars may cross the tracks. A gate 
is located at the crossing to prevent cars from crossing the tracks when a train 
is near. A sensor on each track detects the arrival of trains on that track. The 
critical requirement of the system is that whenever a train is in the crossing the 
gate must be down, and when no train has been in between the sensors and the 
crossing for a reasonable amount of time, the gate must be up. The complete 
ASTRAL specification of the railroad crossing system, which was written by 
Paul Kolano, can be found at http://www.cs.ucsb.edu/~dang. 

The ASTRAL specification includes a global specification and two process 
specifications: one is Gate and the other is Sensor. A transition system is spec- 
ified inside each process specification along with local assumptions and local 
properties. ASTRAL adopts a modularized view of the specified system at the 
level of correctness proofs: the global properties in the global specification can 
be verified by using global assumptions and local properties (instead of the ac- 
tual transition behaviors) of each process instance; local properties of a process 
instance can be verified by using the local assumptions, the local imported vari- 
able clause and the transition system of the instance, without looking at the 
transition behaviors of the other process instances (A reader need not worry 
about the possibility of circular proofs. The ASTRAL proof theory is sound, see 
[8] for details.). 

We take the Sensor process specification to see how parameterized dura- 
tions and parameterized clock constraints are used in ASTRAL. Sensor reports 
whether a train is beyond the crossing, which is indicated by a Boolean vari- 
able train_in_R. The process specification has only two transitions enterJl and 
exit_R, which have parameterized durations enter_dur and exit_dur, respec- 
tively. Transition enter _R changes train_in_R from FALSE to TRUE with duration 
enter_dur. Transition exit_R sets train_in_R back to FALSE after the slowest 
train moves out of the crossing. That is. 
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TRANSITION exit_I 

ENTRY [ TIME : exit_dur ] 

train_in_R 

& now - Start ( enter_R ) >= Rllmin - exit_dur 

EXIT 

train_in_R = FALSE, 

where now indicates the current time, Start (enter_R) is the most recent start 
time of transition enter Jl, and Rllmin is a parameterized constant indicating 
the time for the slowest train to move out of the crossing. One of the two transi- 
tions is nondeterministically chosen to fire as long as the ENTRY condition of the 
chosen transition is satisfied. However, if neither of them is fireable, the process 
idles: now progresses by one time unit and train_in_R does not change. 

But, according to this semantics, transition enter_R must fire immediately 
whenever transition exit_R completes. This is not the intended behavior of 
Sensor. In fact, it is desirable that enterJl fires only when a train actually 
comes. But enter_R specified as above only tells what happened (set train_in_R 
to true) when a train comes, instead of the time when a train comes. The pattern 
of a train’s arrival is controlled by the environment of the process. In this spec- 
ification, transition enter_R is declared as exported. That is, it must be called 
by the environment in order to fire. In Sensor, the environment is specified by 
the environment clause, which states that two consecutive calls of enter_R must 
be separated by at least Rllmin many time units. The safety property of the 
process is specified as a schedule, which has two conjuncts. The first conjunct of 
the schedule says that a train will be sensed within enter_dur many time units 
after a call of transition enter_R is placed. The second conjunct of the schedule 
says that the sensor will be reset when the slowest train is beyond the crossing. 
It is assumed that initially now is 0 and train_in_R is FALSE. 

We manually translated Sensor into a generalized discrete timed automaton 
and computed the transitive closure of the one-step transition of the automaton 
using the Omega Library [26], which is a tool to manipulate Presburger formulas. 
Experiments were run on a Sun workstation with 4 CPUs and 256M real memory 
and 512M swap memory. Unfortunately, the closure, even when the durations 
(enter_dur and exit_ dur) were set to 1, could not be computed. Then, we 
used the H-bounded approximation on the automaton with B = 2>. This time, 
the binary reachability can be computed in about one minute of CPU time 
and using 170M memory. But the other two approximation approaches were still 
too expensive to calculate. 

6 Conclusions and Future Work 

We studied generalized discrete timed automata with general linear relations 
over clocks and parameterized constants as clock constraints and with parame- 
terized durations. We focused on three approximation techniques and automata- 
theoretic characterizations of binary reachability under these approximations. 
The characterizations allow us to show that the safety analysis problem is de- 
cidable with respect to generalized discrete timed automata with unit durations. 
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and deterministic generalized discrete timed automata with parameterized du- 
rations (modulo the approximations) . We used an example specification written 
in ASTRAL to run a number of experiments using one of the approximation 
techniques. The results of the experiments show that further improvements to 
the approximations have to be developed, since currently they are not practical 
for large specifications. 

For future work, we want to investigate how the approximation techniques 
proposed in this paper can be combined with existing image-approximation tech- 
niques [12] in debugging infinite state systems. Solutions to this problem would 
lead to an implementation of an effective specification debugger for large real- 
time specifications. Another research issue is how to extend the results in this 
paper to the case of generalized timed automata with dense clocks. Recent ideas 
used in [9] may provide some insights. 
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Abstract. In contrast to the adoption of linear additive query updating 
techniques in existing popular algorithms for user preference retrieval, in 
this paper we design two types of algorithms, the multiplicative adaptive 
query expansion algorithm MA and the multiplicative adaptive gradient 
search algorithm MG, both of which use multiplicative query expansion 
strategies to adaptively improve the query vector. We prove that algo- 
rithm MA has a substantially better mistake bound than the Rocchio’s 
and the Perceptron algorithms in learning a user preference relation de- 
termined by a linear classifier with a small number of non-zero coefficients 
over the real-valued vector space [0, 1]". We also show that algorithm MG 
boosts the usefulness of an index term exponentially, while the gradient 
descent procedure does so linearly. Our work also generalize the algo- 
rithm Winnow in the following aspects; various updating functions may 
be used; multiplicative updating for a weight is dependent on the value 
of the corresponding index term, which is more realistic and applicable 
to real-valued vector space; and finally, a number of documents which 
may or may not be counterexamples to the algorithm’s current classihca- 
tion are allowed. Practical implementations of algorithms MA and MG 
have been underway in the next stage development of our intelligent web 
search tools. 



1 Vector Space and User Preference 

Let TZ be the set of all real values, and let 7^^ be the set of all non-negative 
real values. Let n be a positive integer. In the vector space model in informa- 
tion retrieval [11], a collection of n indexing terms Ti,T 2 ,...,T„ are used to 
represent documents and queries. Each document d is represented as a vector 
d = {di, . . . ,dn) such that for any t, 1 < i < n, the i-th. component of d is 
used to determine the relevance (or weight) of the i-th term in the document. 
Because a document vector can be normalized, without loss of generality we only 
consider the real-valued vector space [0, 1]” in this paper. Given any two vectors 
X = {x\, . . . ,Xn),y = {yi, • ■ • , yn) G [0, 1]” (or 7^"), we use x • y to denote their 
inner product xiyi + ■ ■ ■ + Xny-n- 

Let 77 be a collection of documents. Given any two documents in 77, we 
assume that a user would prefer one to other or regard both as being equivalent 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 540-549, 2001. 
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with respect to her information needs (or search queries). In other words, user 
preference of documents in T> defines a preference relation ^ over T> as follows: 

Vd, d' G 2?, d ^ d' 4=^ the user prefers d' to d. 

It has been shown in [I] that if a user preference relation ^ is a weak order 
satisfying some additional conditions then it can be represented by a linear 
classifier. That is, there is a query vector q = (gi, . . . , g„) G 7?.” such that 

Vd,d'Gl7, d -< d' 4=^ q*d<q*d'. (1) 

In general a linear classifier over the vector space [0, 1]" is a pair of (q, 9) which 
classifies any document d as relevant if q • d > 0, or irrelevant otherwise, where 
the query vector q G 7?." and the classification threshold 9 G TZ^. Recall that 
q • d is usually used as the relevance rank or (score) of the document d with 
respect to user preference. 

A natural way to understand a user preference relation ^ is document rank- 
ing: A user prefers a document d to a document d', if and only if she ranks 
d higher than d'. When a user has no preference of d to d' nor d' to d, then 
she is really not interested in how those two documents are actually ranked. 
Based on such understanding, the following linear acceptable ranking strategy 
was proposed in [13]: 

Vd, d' G 27, d -< d' q*d<q*d', (2) 

where q G 2?" is the query vector determined by the user. 

Let T>r be the set of all relevant documents in 27 with respect to a user’s in- 
formation needs (or search queries), and 27j^ the set of all irrelevant documents. 
If we assume that a user preference relation has a simple structure with only 
two levels, i.e., one level consisting of all relevant documents and the other con- 
sisting of all irrelevant documents. Within the same level, no preference is made 
between any two documents. Then, finding a user preference relation satisfying 
the expression (1) is equivalent to the problem of finding a linear classifier (q, 9) 
over [0,1]" with the property 

Vd G 27, d G 7^ 4=^ q • d > 0, (3) 

where q G 7?." is the query (or weight) vector. Similarly, finding a linear accept- 
able ranking strategy satisfying expression (2) is equivalent to the problem of 
finding a query vector q G 2?" with the property 

Vd, d' G 27, d G 27j^ and d' G 27^. q • d < q • d'. (4) 

The goal of relevance feedback in information retrieval is to identify a user 
preference relation ^ with respect to her information needs from documents 
judged by that user. Since user preference relations vary from different users and 
may have various unknown representations, it is not easy for an information sys- 
tem to find such relations. The existing popular relevance relevance algorithms 
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basically use linear additive query expansion methods to find a user preference 
relation as follows: 

- Start with an initial query vector qg. 

- At any step k > 0, improve the fc-th query vector to 

Qfc+i = Qfe + cTidi + • • • + Ogds, (5) 

where di, . . . , are the documents judged by user, and the updating 
factors ai € TZ for i = 1, . . . , s. 

One particular and well-known example of relevance feedback is Rocchio’s 
similarity-based relevance feedback [7]. Depending on how updating factors are 
used in improving the /c-th query vector as in expression (5), a variety of rele- 
vance feedback algorithms have been designed [11]. A similarity-based relevance 
feedback algorithm is essentially an adaptive supervised learning algorithm from 
examples [12]. The goal of the algorithm is to learn some unknown classifier 
(such as the linear classifier in expression (3)) that is determined by a user’s 
information needs to classify documents as relevant or irrelevant. The learning 
is performed by modifying or updating the query vector that serves as the hypo- 
thetical representation of the collection of all relevant documents. The technique 
for updating the query vector is linear additions of the vectors of documents 
judged by the user. This type of linear additive query updating technique is 
similar to what used by the Perceptron algorithm [10]. The linear additive query 
updating technique has some disadvantage: It’s converging rate to the unknown 
target classifier is slow, because it has been proved that the Perceptron algo- 
rithm [8] and the Rocchio’s relevance feedback algorithm [5] with any of the four 
typical similar measures [11] have an I7(n) lower bound on their performance 
of learning in the n dimensional binary vector space. In the real world of web 
search, a huge number of terms (usually, keywords) are used to index web docu- 
ments. To make the things even worse, no users will have the patience to try, say, 
more than 10 iterations of relevance feedback in order to gain some significant 
search precision increase. This implies that the traditional linear additive query 
updating method may be too slow to be applicable to web search, and this mo- 
tivates us to design new and faster query updating methods for user preference 
retrieval in section 4. 

For a user preference with respect to her information needs, for any index 
term T, define 

- 1 27 1 = the total number of documents in the collection 27 

- [27^1 = the total number of relevant documents 

- rj = the number of documents indexed by T 

- 7 = the number of relevant documents indexed by T 

A gradient descent procedure has been designed in [13] to find an acceptable 
ranking strategy satisfying expression (2). The idea of the procedure is to mini- 
mize ranking errors through linear additions of d' — d for all pairs of documents 
d' and d that, according to expression (3), are ranked incorrectly. This strat- 
egy is similar to what used by the Perceptron algorithm [6]. When the gradient 
descent procedure is applied to find an acceptable ranking satisfying expression 
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(4), it has been shown [13] that after the first iteration from a zero initial query 
vector the procedure weighs an index term T linearly in , an approxi- 

mate measure of the usefulness of index term T for distinguishing relevant and 
irrelevant documents. It has also shown that under certain good index term prob- 
ability distribution the above usefulness measure for the index term T reaches its 
expected maximum when rj = 0.5|2?|, a justification for choosing mid-frequency 
terms in indexing [11]. In contrast to the linear additive query updating strat- 
egy used in the gradient descent procedure [13] for minimizing ranking errors, in 
section 4 we will design a new algorithm with the multiplicative query updating 
strategy. We will show that after the first iteration from a zero initial query 
vector our new algorithm weighs an index term T exponentially in — y^. 
This means that exponentially large gaps will be generated for index terms with 
respect to measures of their usefulness. Hence, a document with a good index 
term will be ranked exponentially higher than one without the good index term, 
thus more ideal ranking effect will be generated for users. 

2 The Multiplicative Adaptive 
Query Expansion Algorithm 

In this section we design a multiplicative query updating technique to identify a 
user preference relation satisfying expression (1). We believe that linear additive 
query updating yields some mild improvement on the hypothetical query vector 
towards the target user preference. We want a query updating technique that 
can yield dramatic improvements so that the hypothetical query vector can be 
moved towards the target in a much faster pace. Our idea is that when an index 
term is judged by the user, its corresponding value in the hypothetical query 
vector should be boosted by a multiplicative factor that is dependent on the 
value of the term itself. 

Algorithm MA{qo, f, 9): 

(i) Inputs: 

qo, the non-negative initial query vectory 
f(x) ; [0, 1] — > , the updating function 

9 > 0, the classification threshold 
(a) Set k — 0. 

(Hi) Classify and rank documents with the linear classifier (qfc,0). 

(iv) While (the user judged the relevance of a document A) do { 
for i = 1, . . . , n, do { 

/* Qfc = d = (di, . . . ,d„) V 

if (d. / 0) { 

if (qfc A 0) set qi,k+i = qi,k else set qi,k+i = 1 /* 

adjustment */ 

if (d is relevant) /* promotion */ 
set Q'i.fc+i ~ (1 “f f idi)')qi^k+i 
else /* demotion */ 
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qi,k+i = 

} 

} 

} 

(v) If the user has not judged any document in the k-th step, then stop. Oth- 
erwise, let k = k 1 and go to step (iv). 

/* The end of the algorithm MA */ 

In this paper, we are only interested in non-decreasing updating functions f(x) : 
[0,1] ^ R+, because we want multiplicative updating for an index term is 
proportional to the value of the term. 

We now analyze the the performance of algorithm MA when it is used to iden- 
tify a user preference satisfying expression (3), a linear classifier (q, 0). Here, we 
consider a zero threshold, we say that algorithm makes an classification error 
at step k if the user judged a document as a counterexample to the algorithm’s 
current hypothesis. We estimate the total number of classification errors algo- 
rithm MA will make based on the worst-case analysis. We also let at most one 
counterexample may be provided to the algorithm at each step. From now on to 
the end of this section we assume that q is a non-negative query vector with m 
non-zero components and 9 > Q. Define 

!3 = min{qi,...,qn}. 

Definition 1. We say that documents in the collection T> are indexing with 
respect to a threshold 5, 0 < d < 1, if for any document d = [di, . . . , d„) € T) we 
have either di = 0 or 6 < di, 1 < i < n. 

In other words, when a document is indexed with respect to a threshold S, 
any index term with a value below the threshold S is considered not significant, 
and hence set to zero. Recall that in the vector space a document and its vector 
have the equivalent meaning, so we may not distinguish the two concepts. 

Lemma 1. Assume that documents are indexed with respect to a threshold S. 
Let u denote the total number of promotions algorithm MA needs to find the 
linear classifier (q, 0). Let m denote the number of non- zero components in q. 
Then, 

/ wlog|5 
log(l + /(5))- 

Proof. Without loss of generality, we may further assume that the m non-zero 
components of q are qi, . . . , qm- When a promotion occurs at step k, a relevant 
document d is given to algorithm as a counterexample its classification. Because 
of document indexing with respect to the threshold 5, there is some i with 
1 < i < m such that di > 5. This means that the i-th component qi^k of the 
query vector q^, will be promoted to 



( 7 i,fe+i = (1 -I- f{di))qi^k > (1 + f{S))qi,k, 



(6) 
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because / is non-decreasing. Since qi^k will never be demoted, it follows from 
expression (6) that qi^k can be promoted at most 



log Is 

iog(i + m) 



( 7 ) 



times. Since each promotion yields a promotion for at least one qi^k for 1 < i < m, 
the total number of promotions u is at most m times the value given in expression 

( 7 ). 



Theorem 1. Assume that documents are indexed with respect to a threshold S. 
Let T denote the total number of classification errors algorithm MA makes in 
order to find the linear classifier (q, 0) over the real-valued vector space [0, 1]". 
Let m denote the number of non-zero components in q. Then, 

rj, < [(1 + /(l))(rt -m) + 7](1 + f{5))5 f{l){l + f{6)) ”^fogp 

f{S)0 ^ f{S) ^ + 



(Hence, if 0 = ^ is chosen, T = O(klogn).) 



Proof. Without loss of generality, we may assume that the m non-zero compo- 
nents of q are qi,. . . ,qm- We estimate the sum of the weights Yl'i=iT,k- Let 
u and V be the number of promotion steps and the number of demotion steps 
occurred during the learning process, respectively. Let tk denote the number of 
zero components in q^ at step k. Note that once a component of q^ is promoted 
to a non-zero value, it will never become zero again. For a promotion at step k 
with respect to a relevant document d judged by the user, for i = 1, . . . , n, we 
have 

if di 0, 

(1 + f{di)), if dij^O and Qi^k = 0, 

(1 f di ^ 0 and Qi^k ^ 0- 

Since a promotion only occurs when 

n 

Qfc • d — ^ ^ diQi^k — ^ ^ Qi.,k ^ ^5 

di^O and 



we have 



Z=1 



^9i,fc+l — ^ 9i,fc+l + ^ Qi,k-i-l + ^ di,k+l 

di^O and qi ^—O di^O and di—0 

(l + /(c^i))+ ^ (1 -I- f{dj))qi^k + qi,k 

di^O and qi ^—O and qij^z^O 

i + /(i) 



Xi—0 



< (i + /(i))tfe 



%,fc +y|g»,fc- 

and q^ i^z^O 






s 
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< (l + /(l))tfe 



1 + /( 1 ) 



dtqi,k + 



di^O and 



^ (l + /(l))^feH ^ d + qi,k- 



(8) 



i=l 



For a demotion at step k with respect to an irrelevant document d judged by 
the user, for z = 1, . . . , n, we have 

qi , k+l — qi,k (1 , ~ FTT \^ di,k ^ 9 i,fc (1 , TFx \') di , k - 

Since a demotion occurs only when X^r=i diqi,k > d, we have 

1 



^ ' qi , k+l ^ ^ ' qi,k (1 



i=l 



i=l 



1 + /W'g 



Qi,k 






m 



i=l 



a + m)Si 






2 = 1 



< ~ 



m 



2 = 1 



a + m)s 






< X! - 



/(^) 



2 = 1 



(l + /(<5))^ 



2 = 1 



0 



(9) 



Let the sum of the initial weights be a. Hence, after u promotions and v demo- 
tions, 



J2d^,k+l < (1 + /(!)) XI + XI qifi 



fim 



f{6)0v 

a + m)s 



< (1 -I- /(l))(n -m) + a + 



f{l)9u f{S)9v 



6 {i + fms 

Note that at any step the weights are never negative. It follows from the above 
relation that 

[(1 + f{^)){n - m) + a]{l + f{6))S f{l){l + f{S))u 

me m 

It follows from Lemma 3.2 and the above analysis that the total number of 
promotions and demotions, i.e., the total number of classification errors T, is 
bounded by 



V < 



(10) 



T < V + u 

[(1 + /(l))(n -m) + g](l -h f{S))S f{l){l + f{S))u 

f{6)9 ^ m 

^ [(1 + fmjn -m) + g](l -h f{S))S f{l){l + f{S)) mlog 

f{S)e m ^ hog{i + m) 

This completes our proof. 




Multiplicative Adaptive Algorithms for User Preference Retrieval 547 



3 The Multiplicative Gradient Search Algorithm 

In this section, we design algorithm MG for finding a query vector q satisfying the 
acceptable ranking strategy condition (2). Algorithm MG uses a multiplicative 
query updating technique to minimize its ranking errors. 

Algorithm MG{qo, f): 

(i) Inputs: 

qo, the non-negative initial query rectory 
f{x) : [0, 1] — > , the update function 

(a) Set k = 0. 

(Hi) Let qfc be the query vector at step k. Identify the set of mistakes 
G(qfc) = {< d, d' > |d ^ d' , qj, • d > q^ • d'}. 

U r{qk) = 0, stop. 

(iv) For each pair < d, d' >G U(qfe), do { 

for i = 1, . . . , n, do \ 

if (d' / 0) { 

if / 0) set qi,k+i = qi,k else set qi,k+i = 1 /* 

adjustment */ 

set qi,k+i = (1 + f{d'i))qi,k+i /* promotion */ 

} 

tf (d, / 0) 

set qi,k+i = i+f(d ) /* demotion */ 

} 

} 

(v) Let fc = fc + 1 and go to step (iv). 

/* The end of the algorithm MG */ 



Theorem 2. Assume that algorithm MG is applied to find an acceptable ranking 
strategy satisfying condition (4). If one chooses the initial query vector qo = 0, 
then after the first iteration, for any 1 < i < n, the weight qi^i for the i-th 
indexing term in qi is 



9 1 = n n 

d^Vir d'^Vr 



1 + fjd'fi 

1 + f{di) ■ 



In particular, when a linear updating function f{x) = a is chosen. 

Proof. Since the acceptable ranking strategy satisfying condition (4), it follows 
from qo = 0 that at the first iteration one obtains 



'T(qo) = {< d,d' > |d G T>ir,d' G T>r}. 

Note that during the first iteration, for each pair < d, d' >G G(qo), a promotion 
performed for d' and a demotion is performed for d. This implies that for any 
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i, ^ ^ i ^ n, after the first iteration the value qi^i with respect to the i-th index 
term is 



%i= n n 

d^G'Z^T’ 



1 + /(^') 

1 + f{di) ■ 



When / is a constant function mapping with f{x) = a, it easily follows from 
the above expression 



^ TT n i±/K) = n (1 + /K)) 

YldGT>ir- Ud'^vS^+im 

_ rid'gPr ridgPir (1 + /K)) Od 'gPr (i + /K))i^--i 

(1 + m)) 

ridGX)ir (i + M))i^-i 

" ndgi,..(l + «)'^’-' " + 

= (1 + = (1 _|_ Q,)(l'^|-|'^>'l)7-|'Dr|(»)-7) 



This completes our proof. 



4 Concluding Remarks 

Our motivations of the work in this paper come from the reality of web search: 
Web search users usually have no patience to try, say, more than 10 iterations 
of relevance feedback for some intelligent search system in order to gain certain 
significant search precision increase. Existing popular algorithms for user prefer- 
ence retrieval have their own beauty and advantages. Because of the adoption of 
linear additive query updating techniques, when used to identify user preference 
determined by a linear classifier those algorithms, such as Rocchio’s algorithm 
and its variants, the Perceptron algorithm and gradient descent procedure, have 
either a slow converging rate or small boosting on the usefulness of an index 
term. 

In contrast to the adoption of linear additive query updating techniques in 
the those existing algorithms, we design two types of algorithms, the multiplica- 
tive adaptive query expansion algorithm MA and the multiplicative adaptive 
gradient search algorithm MG, both of which use multiplicative query updat- 
ing techniques to adaptively improve the query vector. We have analyzed the 
worst-case performance of algorithms MA and MG. We plan to investigate their 
average-case performance. We feel that this task is very challenging. We also 
plan to conduct empirical studies to understand the behaviors of the algorithm 
MA and GA with real world data sets. Finally, we would like to point out that 
implementations of algorithms MA and MG have been underway in the next 
stage development of our intelligent web search tools [2,3,4]. 
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Abstract. The problem of parametric scheduling is concerned with check- 
ing whether a job-set is parametrically schedulable, subject to a set of 
imposed constraints. In real-time scheduling, parameterization of the 
schedule plays an important role in extending the flexibility of the sched- 
uler, particularly in the presence of variable execution times. It has been 
shown that the existence of parametric schedules can be determined in 
polynomial time when the constraints are restricted to those that can be 
represented by a network, unimodular matrix. In this paper, we extend 
the class of constraints for which parametric schedules can be deter- 
mined efficiently to include network constraints, such as weighted sum of 
completion times. 



1 Introduction 

Uncertainty in problem parameters is often a feature of real-time scheduling 
[16]. In the literature, there exist two broad approaches to address uncertainty, 
viz. stochastic and deterministic. In stochastic scheduling, the goal is to pro- 
vide probabilistic guarantees that the constraints imposed on the job-set will 
be met, under the assumption that the non-constant problem paramters belong 
to a fixed well-understood distribution [11,4,14]. This approach is not applica- 
ble in the case of “hard” real-time systems [16,12,9], where the guarantees have 
to be absolute i.e. there is no room for error. Hard real-time systems are typ- 
ically composed of mission-critical tasks, wherein the consequences of failure, 
i.e. a violation of constraints can be catastrophic. Hard real-time systems call 
for deterministic approaches to the issue of uncertainity. Some of the common 
approaches include worst-case assumptions [11], static scheduling [19] and para- 
metric scheduling [5]. Worst-case assumptions regarding execution times (say) 
run the risk of constraint violation at run-time [17] and hence the strategy is 
not always correct. Static approaches make extremely conservative assumptions 
about each constraint in order to determine the existence of a feasible schedule. 
This approach is correct inasmuch as the goal is to provide a set of start-times 
that cannot cause constraint violation. However, static scheduling is extremely 
inflexible and even simple constraint sets (see Section §3) may not have static 
schedules. Parametric scheduling attempts to combine the guarantees provided 
by static scheduling with the flexibility of stochastic scheduling to enhance the 
class of constraints for which feasible schedules exist. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 550-560, 2001. 
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In real-time scheduling literature, it has been shown that polynomial time 
algorithms exist for determining parametric schedules, when the constraints are 
restricted to be “standard” [5,2]. In this paper we are concerned with the fol- 
lowing question: Can parametric schedulability he determined efficiently for non- 
standard constraints, with at most 2 tasks per constraint ? We provide an affir- 
mative answer to the above question by designing a polynomial time algorithm 
for network constraints. 

The rest of this paper is organized as follows: Section §2 provides a formal 
description of the problem under consideration. Section §3 discusses the moti- 
vation underlying our research while related approaches are detailed in Section 
§4. Our algorithm and the accompanying analysis are presented in Section §5. 

2 Statement of Problem 

2.1 Job Model 

Assume an infinite time-axis divided into windows of length L, starting at time 
t = 0. These windows are called periods or scheduling windows. There is a set of 
non-preemptive, ordered jobs, J = { Ji, J2 , . . . , J„} that execute in each schedul- 
ing window. 

2.2 Constraint Model 

The constraints on the jobs are described by System (1): 

A.[s, e] < 6, e G E, (1) 



where, 

~ A is an m X 2.n rational matrix, in which every row represents a network 

constraint (to be defined below), 

E is the convex set defined by the axis-parallel hyper-rectangle (aph) 

T = [li,Ui] X [l2,U2] X . . .[ln,Un] (2) 

— s = [si, S 2 , . . . , s„] is the start time vector of the jobs, and 

— e = [d, 62 , . . . , e„] G E is the execution time vector of the jobs 

The characterization of E as an aph is intended to model the fact that the 
execution time of job Ji is not a fixed constant, but can assume any value 
in the pre-specified range [li,Ui], depending on factors such as loop-length. In 
real-time systems such as Maruti, a number of runs of the job-set are carried 
out to statistically estimate the range [li,Ui] [10]. 

Observe that System (1) can be rewritten in the form 



G.s -I- H.e <b, e G E 



( 3 ) 
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Definition 1 . Standard Constraint: A constraint is defined to he standard, if 
represents a relative separation relationship between at most two tasks, i.e. ma- 
trices G and H are flow graph, unimodular, with the added provision, that the 
entry H[i,j] must equal G[i,j] if it is non-zero. 

For instance a constraint of the form: Si — Sj < —8, which specifies that Job 
Jj must start 8 units after job Ji starts is a standard constraint. A detailed 
description of standard constraints is available in [5]. Standard constraints can 
be represented by edges of a fiow graph [18]. 

Definition 2 . Network Constraint: A constraint is said to be a network con- 
straint, if it can he expressed in the form 

a. Si + b.Sj < c.Ci + d.Cj, a,b,c,d€ Q (4) 

i.e. all co-efficients are arbitrary rational numbers. 

Network constraints also have a graph structure; the “edge” representing the 
set of constraints between two jobs Ji and Jj form a polyhedron in the 4 variables 
Si,ei,Sj,ej, with and ej being universally quantified [6]. 

2.3 Query Model 

Suppose that job has to be dispatched. We assume that the dispatcher has 
access to the start times {si, S 2 , ■ . ■ , Sa-i} and execution times {ei, 62 , ... , Ca-i} 
of the jobs { Ji, J 2 , ■ • • , Ja-i}- 

Definition 3. A parametric schedule of an ordered set of jobs, in a scheduling 
window, is a vector s = [si, S 2 , . . . , Sn], where si is a rational number and each 
Si,i 1 is a function of the start time and execution time variables of jobs 
sequenced prior to job Ji, i.e. {si, ei, S 2 , 62 , ... , Si_i, ei_i}. Further, this vector 
should satisfy the constraint system (1) for all execution time vectors e S T. 

The combination of the Job model, Constraint model and the Query model 
constitutes a scheduling problem specification within the E-T-C scheduling frame- 
work [17]. 

The Parametric Scheduling problem is concerned with the following two is- 
sues: 

1. Determining whether the given job-set has a parametric schedule, i.e. a 
schedule as defined in Definition (3); 

2. Computing the start-time of a job in each scheduling window, assuming that 

(a) The parametric schedulability query has been decided affirmatively, and 

(b) The start and execution times of all jobs sequenced before it, are pro- 
vided. 

This corresponds to the online dispatching phase. 

The discussion above directs us to the following formulation of the parametric 
schedulability query: 

3si Vei G [;i,Mi] 3s2 Vc 2 G [/ 2 -W 2 ], • • -3s„ Ve„ G [;„,M„] A.[s,e ]<6 ? (5) 
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3 Motivation 

The motivation for Parametric Scheduling has been provided in great detail in 
[5] and [17]. The key issue is that a simple constraint system such as: 

Si + Cl < S2 

S2 ^ Si + Cl + 1 

ei G [3, 5] 



does not have a static schedule i.e. no assignment of rational numbers to the 
start times can guarantee the meeting of both constraints for all values of e\ in 
the range [3, 5]. Note that the schedule represented by s = [0, si + ei]^ is a valid, 
feasible, albeit parametric schedule. 

An interesting line of research is obtaining schedules that satisfy certain 
optimization criteria. The complexity of Parametric Optimization for general 
constraints is not known. However, we can approximate optimization functions 
involving at most two tasks through the use of network constraints. Optimization 
criteria, formulated as performance metrics arise in various situations including 
Job-shop, Flow-shop and Machine-Shop [11]. Typical performance metrics are 
Makespan, Sum of Completion times. Weighted sum of completion times (also 
called aggregate metrics in the Operations Research literature). Lateness and 
Tardiness. For instance, the need to minimize the weighted sum of completion 
times of jobs J\ and J 2 can be approximated through Wi.(si-|-ei)-|-ru 2 -(s 2 + e 2 ) < 
k, for suitably chosen k [16]. The other performance metrics can be similarly 
approximated. 

4 Related Work 

The concept of Parametric Scheduling was introduced in [12]. In [5], polynomial 
time algorithms were presented for the case, when the constraints imposed on 
the job-set are “standard”. [18] argued that “standard” constraints could be 
represented by a flow, graph (i.e. the matrix A is network, unimodular). [2] 
extends the “standard constraint” model to include the case, in which constraints 
can exist between adjacent windows. In [17], it is shown that the problem can 
be solved in PSPACE, when the constraints are arbitrary. However, no hardness 
result is known for this problem [7,1]. 

A curious feature of the algorithms proposed in the literature for Parametric 
Schedulability is that none of them exploit the ordering information in the job- 
set. In both [5] and [2] polynomial time bounds are derived by observing that 
the number of relative constraints between any two tasks is bounded, if only 
strict relative constraints are permitted. In this paper, we explicitly take into 
account the ordering information to develop two new concepts viz. Constraint 
Domination and Constraint Orientation, which in turn are used to develop poly- 
nomial time algorithms for testing parametric schedulability in arbitrary network 
constraints. 
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Linear programs with at most 2 variables per constraint (LI(2)s) have re- 
ceived quite a bit of attention in the Operations Research community. [15] was 
the first to observe the correspondence between LI(2)s and graphs; in [6], the 
Fourier-Motzkin elimination procedure was used to provide a strongly polynomial 
algorithm which to date is the fastest known algorithm for this problem. 

5 Algorithms and Complexity 

Algorithm (5.1) presents our strategy for testing parametric schedulability. 



Function PARAMETRIC-SCHEDULER (T, A, b) 

1 : for (i — n down to 2 ) do 
2: ELIM-UNIV-VARIABLE(e,) 

3: if (CHECK-lNCONSISTENCY())then 

4: return ( false ) 

5: end if 

6: Prune-Constraints() 

7: ELIM-EXIST-VARIABLE(Si) 

8: if (CHECK-lNCONSISTENCY())then 

9: return ( false ) 

10: end if 

1 1 : end for 

12: Elim-Univ-Variable (ei) 

13: if ( o < Si < 6, a,b > 0) then 

14: Valid Parametric Schedule Exists 

15: return 

16: else 

17: No Parametric Schedule Exists 

18: return 

19: end if 



Algorithm 5.1: A Quantifier Elimination Algorithm for determining Parametric 
Schedulability 



Algorithm (5.2) describes the procedure for eliminating the universally quan- 
tified execution variable € [li,Ui\. The Fourier-Motzkin elimination technique 
discussed in [20] represents one implementation of Elim-Exist- Variable. In 
general, any polyhedral projection method suffices. In our work, we assume that 
the Fourier-Motzkin procedure is used to eliminate the existentially quantified 
(start-time) variables ^ . When a variable (start-time or execution time) is elim- 
inated, inconsistencies and/or redundancies could result. 

Check-Inconsistency() identifies inconsistencies and declares the system to 
be infeasible, while Prune-Constraints() identifies redundancies and elimi- 
nates them. 

^ A detailed exposition of the Fourier-Motzkin elimination procedure is available in 

[ 20 ]. 
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Function Elim-Univ- Variable (A, b) 

1: Substitute e* = U in each constraint that can be written in the form e* > () 
2: Substitute e* = Ui in each constraint that can be written in the form e* < () 



Algorithm 5.2: Eliminating Universally Quantified variable Ci £ [U,Ui] 

The correctness of Algorithm (5.2) has been argued in [5], while the correct- 
ness of the Fourier-Motzkin procedure is discussed in [13]. 

We point out that Algorithm (5.1) is similar to the ones outlined in the litera- 
ture. The difference is the implementation of procedures Prune-Constraints() 
and the analysis provided in §5.1. 

5.1 Analysis 

Observe that the procedure Elim-Univ-Variable() does not increase the num- 
ber of constraints. However, Elim-Exist-Variable() has the potential to in- 
crease the number of constraints substantially. Assuming that the Fourier- 
Motzkin elimination algorithm is used, the elimination of k start-time variables, 
could result in the creation of as many as rn^ constraints. One such pathological 
example is provided in [13]. In [5] and [18], it was pointed out that “standard” 
constraints are closed under Fourier-Motzkin elimination i.e. the elimination of 
an existential variable results in the set of constraints staying standard. Using 
the notation, in [18], this corresponds to saying that contracting a vertex of the 
flow graph representing the constraint set, does not destroy the its graph struc- 
ture. Since a graph has at most O(n^) edges at all times, the polynomiality of 
the algorithm for standard constraints follows. 

In our case though, there is no obvious way to either represent the set of 
constraints or bound their number under Fourier-Motzkin elimination, since in 
addition to relative constraints, we also have sum constraints, as discussed in 
Section §3. We make the following observation: 

Observation 51 Network constraints are closed under Fourier-Motzkin elimi- 
nation, using Algorithm (5.1). 

Observation (51) follows from the fact an existential variable i.e. a start time 
variable is eliminated only after the corresponding execution time variable has 
been eliminated. Consequently, its elimination results in a network constraint 
between two other jobs. For instance consider the following constraint set: 

1. S 3 < Si -|- Cl -|- 14; 

2. Si -|- S3 < 22; 

3. S 2 -l- 22 ^ S 3 -|- 63 ; 

4. 636 [3,5]. 

The elimination of results in: 

1. S 3 ^ Si + Cl + 14; 

2. Si T S 3 ^ 22; 

3. S 2 + 19 < S 3 ; 
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The elimination of S3 (by pairing off constraints in which S3 occurs with opposite 
polarity) gives rise to the following set of constraints: 

1. S2 + 19 < Si + ei + 14; 

2. S2 “t“ 19 ^ 22 — si; 

The key point is that the network structure is preserved, under the elimination. 
Observe that if 63 were not eliminated, prior to eliminating S3 the closure claim 
of Observation (51) no longer holds. 

Let Sij denote the set of constraints between the two jobs Ji and Jj) {i < j). 
We now present an informal overview on the nature of constraints I G Sij . Infor- 
mally, a relative constraint between two jobs either specifies increased separation 
between them or decreased separation. For instance, the constraint si -I- 8 < S2 
specifies increased separation, while the constraint S2 < si -I- 17 specifies de- 
creased separation. An aggregate constraint either pushes the jobs to the left or 
to the right. For instance the constraint Si -I- S2 < 7 pushes jos Ji and J2 to 
the left i.e. towards 0, while the constraint S3 -I- S4 > 8 pushes jobs J3 and J4 
towards the right, i.e. towards L. 

To proceed with our analysis, we need the following definitions. We associate 
a type with every constraint, specifying whether it a relative constraint or an 
aggregate constraint. 

Definition 4. Sum (Aggregate) constraint: A network constraint of the form 
a. Si + b.Sj < 0 is said to be a sum constraint if both a and b have the same 
sign. 

For instance, si -I- S2 < 7 and — 3.si — 4.S2 < —9 are sum constraints. 

Definition 5. Difference (Relative) constraint: A network constraint of the form 
a.Si + b.Sj < 0 is said to be a difference constraint, if a and b have opposite signs. 

For instance, the constraint si — S2 < —4 is a difference constraint. 

Definition 6. Constraint orientation (Right): A constraint I G Sij is said to 
have a right orientation if it specifies increased separation between Ji and Jj (in 
case of difference constraints), or pushes both jobs to the right (in case of sum 
constraints) . 

Definition 7. Constraint orientation (Left): A constraint I G Sij is said to have 
a left orientation if it specifies decreased separation between Ji and Jj (in case 
of difference constraints), or it pushes both jobs to the left (in case of sum con- 
straints). 

For instance, the constraint si -I- ei -I- 4 < S2 specifies that Job J2 should 
start at least 4 units after Ji finishes. Since it specifies increased separation, it 
is has a right orientation. Likewise, the constraint si J- S3 < 12 requires that 
Ji and J3 move leftward and hence has a left orientation. Using the flow graph 
terminology in [18], a forward edge in the constraint graph has a right orientation 
and a backward edge has a left orientation. 
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Every constraint I € = 1, . . . ,n has an orientation, on account 

of the total ordering on the job-set. Thus a network constraint between 
job Ji and J5 (say) has the effect of either drawing them together or 
pushing them apart. This is not true, if there is no ordering on the 
job-set. The total ordering on the start time variables implies that all 
these variables have an interpretation on the same real axis [0, L], In the 
absence of the total order, each variable has to be interpreted on its own 
axis. Also see [6]. 



Definition 8 . Comparable constraints: Two constraints h,l2 G Sij are said to 
be comparable if they have the same orientation and type. 

Note that only constraints between the same set of jobs are comparable, i.e. 
a constraint I G S'13 and a constraint /' G S 12 are not comparable, regardless of 
their orientation and type. 

Constraint comparability is an equivalence relation partitioning the set of 
constraints between two jobs Sij into the following four categories: 

1. Difference Constraint with left orientation {S}j)] 

2. Difference Constraint with right orientation {Sfj). 

3. Sum Constraint with left orientation 

4. Sum Constraint with right orientation {Sij)] 



Definition 9 . Constraint domination: A constraint l\ is said to dominate an- 
other constraint I2 if and only if they are comparable and l\ I2 i.e. I2 is 
satisfied, whenever l\ is satisfied (l\,l2 G Sij). 

In some sense, the domination relationship attempts to identify constraints which 
are redundant. For instance, s\ — S 2 < ~4 is clearly dominated by si — S2 < —8, 
since if the latter constraint is satisfied, the former is trivially met. The inter- 
esting case is the comparison between constraints in which there exist execution 
time variables. Consider the two comparable constraints: 

1. : Si — S2 < —4 

2. I 2 '■ s\ — S2 "S ~6i; Cl G [3, 5]. 

Observe that I2 still dominates l\] the parametric schedulability query is: 
dsiVci G [ 3 , 5 ] 3 s 2 . . . A.[s, e] < 6 ? 

Since the query is true for all values of e\ in the range [3,5], the constraint 
Si — S2 < —4 is subsumed by the constraint Si — S2 < — Ci; Ci G [3, 5]. This holds 
true for every pair of comparable constraints l\ and I2, i.e. either li I2 or 
I2 ^ h - In other words, the domination relationship imposes a total order on each 
set of comparable constraints between two jobs. We use lexicographical ordering 
to break ties. It follows that in each equivalence class of the partition imposed 
by the comparability relationship there is a unique constraint that dominates all 
other constraints in that class [8]. 
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Definition 10 . The unique elements G S^p k = 1 , . . . , 4 , which dominate all 
the other constraints in their respective partitions are called the dominators of 
that partition. 



Lemma 1 . If a constraint l\ dominates another constraint I2, then eliminating 
I2 from the set of constraints, does not alter the parametric schedulability of the 
system i.e. if the constraint system has a parametric schedule with I2 then it has 
a parametric schedule without I2 and vice versa. 

In other words, we can eliminate I 2 from the constraint set and test for 
parametric schedulability on the reduced set of constraints. In fact, it suffices to 
retain the 4 dominators between each pair of jobs, since all other constraints are 
redundant. 

Proof : We provide a proof for the case in which both li and I2 belong to the 
set S'b. The other three cases, viz. I\,l2 G Sfpli,l2 G Sfp and l\,l2 G Sfj can be 
proved in identical fashion. 

1. Let the initial system have a parametric schedule - We need to prove that the 
system will continue to be parametrically schedulable, even after I2 is elimi- 
nated. But this is obvious, since we are reducing the number of constraints! 
(If Ai is the feasible region, before the removal of I2 and A2 is the feasible 
region after the removal ofl2, then A\ C A2) 

2. Let the initial system be parametrically unschedulable - We need to show that 

the removal 0/^2 does not make the system schedulable. From our assumption 
I2 G S}j, i.e. I2 is a difference constraint that specifies decreased separation 
between the jobs Ji and Jj. Since li dominates I2, the separation specified 
by h is clearly smaller than the separation specified by I2. Let us assume 
the contrary and suppose that the constraint system A' resulting from the 
elimination of I2 is parametrically schedulable. This means that there do not 
exist start times Si, Sj and execution times Ci which could depend upon Si and 
Cj which could depend upon both Si and Sj, such that a negative cost loop is 
created. (The system is infeasible if and only if there is such a negative cost 
loop [3,15,17].) Essentially, we are saying that ...Vsidcj G [li,Ui]\/sj3ej G 
[lj,Uj]...^({A.[s,e] < b} — {I2}) is false. This means that the constraints 
imposed by the sets can co-exist with li.Now consider what happens 

when I2 is added to A'. I2 cannot decrease the separation between jobs Ji and 
J2 any more than what is specified by li. Consequently if a negative loop could 
not be created using l\, then such a loop definitely cannot be created using 
I2, i.e. the constraint system stays parametrically schedulable, contradicting 
the assumption that it was not. 

□ 

This leads us directly to: 

Lemma 2 . There are at most 4 non-redundant constraints between any 2 jobs; 
hence the total number of non-redundant constraints is at most 0{n^). 
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It follows that Elim-Exist-Variable() takes at most 0 { n ^) time, since each 
start-time variable is part of at most 0(n) relationships. Prune-Constraints() 
basically performs the equivalent of finding the 4 maxima between each pair of 
start-times and hence the total time taken is proportional to the number of 
edges, which is O(n^). Checking the consistency of the resulting constraints can 
likewise be carried out in 0{n?) time. 

Since all the above functions are called at most 0{n) times, the above analysis 
leads to the following conclusion. 

Theorem 1. Algorithm (5.1) can he implemented to run in 0{n^) worst-case 
time. 

Proof : Follows from the discussion above. □ 
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Abstract. The issue of knowledge sharing has been an important topic 
in multi-agent research. Knowledge sharing leads to that agents ana- 
lyze, judge and synthesize the told information so as to make agents’ 
own knowledge. To match these applications, this paper builds a logical 
framework for knowledge sharing among agents. We develop a multi- 
modal logic for reasoning about both agents’ knowledge and told in- 
formation. For formalizing the relationship between knowledge and told 
information, we present a framework of semantics, with respect to which 
a sound and complete proof theory is given. 

Keywords; logic, autonomous agents, philosophical foundations 

1 Introduction 

Currently, the research area of multi-agent systems is an active branch in artifi- 
cial intelligence. In 1988, Bond and Gasser [1] defined a multi-agent system as 
a loosely coupled network of autonomous entities called agents which have indi- 
vidual capabilities, knowledge and resources, and which interact to share their 
knowledge and resources, and to solve problems being beyond their individual 
capabilities. From this definition, we can see that the issue of knowledge shar- 
ing is an important topic in multi-agent research. Therefore, many contributions 
have been made to this topic in the area of multi-agent systems, such as the 
knowledge sharing effort [14] and Knowledge Interchange Format (KIF) [15]. 

However, the issue of knowledge sharing among agents that usually have 
peculiarities of their knowledge and reasoning properties in the viewpoints of 
one agent has not been handled yet. We can use the following example to make 
clear what we exactly mean here. An agent i may believe that another agent j’s 
knowledge or belief is veridical^ z.e., in terms of standard multi-modal language, 
Ki{Kja ^ a), for all formulas a. Only in this case, agent i is willing to inherit 
all knowledge of agent j and is ready to accept told information from agent j. 
So, it is important and interesting to present a reasoning model which enables 
agents to explicitly express, in their knowledge, both what they have been told 
and some peculiarities of their colleagues’ knowledge and reasoning properties. 
In this paper, we will tackle this issue. 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 561-570, 2001. 
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On the other hand, although there has been a flourishing of epistemic logics 
formalizing multi-agent knowledge, belief, desire and intention, by using various 
kinds of possible-worlds semantics, despite their varieties of semantics, these 
logics usually assume a fixed epistemic logic for a certain class of modalities, 
and therefore all agents have the same reasoning properties. This is perhaps 
because the philosophical community, where various epistemic logics were firstly 
proposed, attacked and defended, has tended to concentrate on the single agent 
case. However, the case our formalism handles is about multiple heterogeneous 
agents. 

In addition, the problem tackled in this paper is also interesting even in the 
case of fixing an epistemic logic or a class of epistemic properties. There is a few 
modal logics combining the notions of an agent’s belief (or knowledge), desire and 
intention; and there even exist general methods of fusing different modal logics 
(see [4]). However, so far there has not been yet one combining the notions of 
an agent’s knowledge and told information, which is thought surprisingly subtle 
whenever more than one agent is in the picture [5]. 

As a first step, we focus on the following epistemic properties: 

• Veridicality. Only true things are known. 

• Introspection. An agent is aware of what it knows and of what it does not 

know. 

• Consistency. An agent’s knowledge is non-contradictory. 

Note that, by considering different combinations of the above properties, the 
well-known modal logics S5, K, T, K45 and KD45, have been constructed 
and widely discussed. 

In this paper, we aim to develop a model for reasoning about not only multi- 
agent knowledge and told information but also some key reasoning properties of 
agent’s knowledge, including the veridicality and the introspection. We allow the 
explicit mention of the introspection and the veridicality of an agent’s reasoning 
about its own knowledge, so that our language has formulas indicating which 
logic of K, T, S5, K45 is used by an agent. 

The rest of this paper is organized as follows. In next section, we review some 
basic concepts and notations of modal logic, which we will use through this pa- 
per. In Section 3, based on the analysis of relationship between knowledge and 
told information, we define the framework of semantics for our logic. Section 4 
addresses the issue of proof theory of our logic. Finally, we discusses the impor- 
tance of our logic framework in Section 5 and summarizes our contributions in 
Section 6. 

2 Preliminaries 

In this section, we recall some basic concepts and notations related to modal 
logics of multi-agents’ knowledge, which we shall use throughout the paper. 

The language of those logics is propositional logic augmented by the modal 
operators Ki, • • • , K„, where Ki(j) can be read “agent i knows (or believes) </>”. 
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We denote this language by For convenience, we define true as an abbre- 
viation for a fixed valid propositional formula, say p V -'p, where p is primitive 
proposition. We abbreviate -'true by false. 

According to [5], semantics of these formulas can be given by means of 
Kripke structure [8], which formalize the intuition behind possible worlds. A 
Kripke structure is a tuple (LF, tt, /C i, • • • , /C„), where W is a, set of worlds, tt 
associates with each world a truth assignment to the primitive propositions, so 
that tt{w){p) G {true, false} for each world w and primitive proposition p, and 
are binary accessibility relations. By convention, ICf^ and are 
used to refer to the JCi relation and the tt function in the Kripke structure M, 
respectively. We omit the superscript M if it is clear from context. Finally, we 
define 

lCi{w) = {w' I {w,w') G ICi}. 

That is, ICi{w) is the set of worlds agent i considers possible in world w. 

A situation is a pair (M, w) consisting of a Kripke structure and a world w in 
M. By using situations, we can inductively give semantics to formulas as follows: 
for primitive propositions p, 

(M, w) ^ p iff TT^{w){p) = true. 

Conjunctions and negations are dealt with in the standard way. Finally, 

(M,w) ^ ICia iff for all w' G ^ a. 

Thus, an agent i knows a if a is true in all situations that the agent considers 
possible. 

Note that the Kripke structure M is fixed in the above inductive interpre- 
tation. However, it is interesting that our interpretation, as given in the next 
section, is not in the case. 

3 Non-standard Interpretation 

This section presents the framework of semantics for our logic. We focus two 
classes of formulas, i.e., those of the form Kif, and those of the form Tif), which 
is read “agent i is told </>” . We use Kripke structures {W,tt,JCi, - ■ ■ , /C„) and the 
formulas of the form Tif) are interpreted by using the accessibility relation /C^ 
as usual, whereas the interpretation of the formulas of the form Ki(j) is given by 
the analysis of relationship between knowledge and told information. 

3.1 Language 

The language we use, denoted by £^(VI), is £„ augmented by one class of 
modalities Ti and two classes of special proposition constants: 1^, and (1 < 
i < n), corresponding to the reasoning properties: the introspection and the 
veridicality, respectively. Thus, the formula 1^ says that agent i is able to respect 
his knowledge; the formula indicates that the agent knows only true things. 
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We note that it is unnecessary for the language to include some other con- 
stants to express the consistency of agent i’s knowledge since we can use the 
formula ~<Kifalse. 

3.2 For Formulas of the Form Tj</> 

Now we indicate how to interpret formulas of £„(VI). Let be the satis- 
faction relation we are going to define. We here use standard Kripke structures 
(W, 7T, /Cl, • • • , /C„) and situations. The key point is that the accessibility relation 
/Ci in Kripke structure is, however, no longer related to agent i’s knowledge; in 
a each situation (M,w), the syntactic counterpart of those situations {M,w'), 
where w' G K-i{vj), is what agent i has been told rather than his knowledge. 
Thus, given a situation (M,w), we define 
(M,w) 1=^ Ti<j) iff for all w' G /Ci(w)(M, w') |=^ 4>. 

According to the above definition, each modality Ti need not satisfy veridical- 
ity axioms and introspection axioms. Intuitively, the reason for non-introspection 
is clear; we can certainly tell or not tell you something without telling you that 
we have told or not told it. Assuming that the external state of agent i’s remains 
unchanged since the agent was told, it seems that the veridicality depends only 
on the honest of the teller. However, we argue that even if the teller is honest, 
the veridicality of modality Tj is not necessarily true for i-subjective sentences 
since the subjective state of agent i described by i-subjective sentences can be 
changed after agent i is told something. For example, once agent i is told “p is 
true but you do not know it”, this sentence does not hold any longer, because 
agent i’s being told changes agent i’s subjective state. 

3.3 For Formulas of the Form Ki4> 

The relationship between agents’ knowledge and the information that agents 
have been told has received a great deal of attention [11], and is thought of 
surprisingly subtle whenever more than one agent is considered in the picture 
[5]. Before interpreting the formulas of the form Kicj), we give some informal 
remarks on the relationship. Generally, the told information may be subject to 
noise; and thus, in order to obtain their own knowledge, agents need to refine 
the told information via various kinds of refining processes. Of these refining 
processes, we focus on the following three cases for the purpose of this paper: 

• If agent i is veridical, then the agent has the refining process of deleting the 
false pieces of the told information. Thus, even if agent i is told some fact 
p, agent i does not necessarily know p whenever p is false and agent i is 
veridical. 

• There are, for each agent, some special pieces of information, usually con- 
cerning the agent himself, that are straightforward accessible just by his/her 
feeling or observation rather than by his/her or other agents’ reasoning. 
Agents believe or refute such pieces of information firmly, no matter what 
has been told to them. For example, if you feel cold, then you know it, and 
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you still know it after you are told that you feel warm. The introspection 
property here is thought of as such information; if agent i is able to intro- 
spect, then agent i must know it, no matter what has been told to him. 

• If agent i is introspective, then the agent has the refining process of intro- 
spection. This process is surprisingly dependent on the veridicality of agent 
i, which leads the relationship between the told information and the agent’s 
knowledge somewhat subtle. In the absence of the veridicality, the introspec- 
tion of agent i means that if agent i is (resp. is not) told something, then the 
agent must know he/she is (resp. is not) told it. In the presence of veridical- 
ity, agent i pays his/her attention only to true things. The introspection of 
agent i thus indicates that (a) if agent i is told a proposition p which is true, 
then agent i knows he/she is told the fact, and (b) if a proposition p is false, 
then agent i knows that he/she is not told p even though he is indeed told 
p. Nevertheless, in both the cases, we have that if agent i knows something, 
then he/she knows he/she knows it, and if he/she does not know it, then 
he/she knows he/she does not. Therefore, the meaning of introspection we 
use here is consistent with that in the literature. 

Now let us consider the interpretation of the formulas of form Kia. It is cru- 
cial to ask what situations agent i should think possible from the standpoint of 
what he/she actually knows after checking and cogitating on the told informa- 
tion. Let Si{M, w) be the set of all such possible situations for the given situation 
(M,w). Then, we interpret formulas of the form Kia in this way: 

(M,w) 1=^ Kia iff for all € Si(M,w), (M',w') |=^ a. 

We now figure out what Si{M,w) should be. There are four cases according to 
the logical style of agent i in the situation (M,w). Firstly, if the logical style is 
K, i.e. the value of tt{w) is false at both 1^ and V^, then agent i is unable to 
distinguish his/her knowledge from what has been told, and hence the situations 
in Si{M,w) are exactly those {M,w')’s, where w' £ tCi{w). 

Secondly, assume the logical style is T, i.e. 

7r(t(;)(Ii) = false and 7r(t(;)(Vi) = true, 

then, we get Si{M,w) by adding the actual world w to each where 

w' € tCi{w). This enables agent i to delete the false pieces of the told information. 
Thirdly, suppose the logical style is K45, i.e. 

7r(w)(Ii) = true and 7r(w)(Vi) = false. 

Assuming agent i thinks situations possible from his own knowledge, 

we have, by the introspection of agent i, that agent i’s told information in each 
of those situations is exactly that in the actual situation {M,w), which 

is semantically represented as 



JCf{w')=JC^{w). 
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On the other hand, as we mentioned above, the introspection property li is such 
information that if it holds, then agent i must know it, no matter what agent i 
has been told. Thus, in each of those situations li must hold, hence 

TT^ {w'){li) = true. 

Based on the above discussions, we define Si{M, w) as the set of those situations 
where w' G and M' coincides with M but 

ICf^ {w') = ICf^{w) and (w')(Ii) = true. 

For convenience, we denote the above M' Kripke structures by 

[/Ci(rc)’ true 

Finally, let the logical style be S5. Then, by considering the veridicality, we 
should put the actual world w into the set and think that ICf^{w)U{w} 

were the set of worlds possible from the standpoint of agent Fs told information. 
By considering the introspection, we define Si (M, w) in the same way as in the 
case of K45, nevertheless JCf^{w), the set of worlds possible from the standpoint 
of agent i’s told information, is replaced by ICf^ (w) U {ru}. In other words, we 
define Si{M, w) as the set of those situations (M', w'), where w' G ICf^ {w)U{w}, 
and M' coincides with M but 

ICf^ (w') = /Cj^(w) U {re} and tt^ (rt;')(Ii) = true. 

For convenience, we denote the M' Kripke structures by 

^ r_Mwo_. 7 t(w')(i,) ~ 

_/Ci(w) U {w} ’ true 

To summarize. 



{{M ,w')\w' 

if 7 r('u;)(Ii) = false and 7 r(w)(Vj) = false; 
{{M ,w')\w' &lCf{w)}\J{{M,w)}, 

if 7 r('u;)(Ii) = false and 7 r(w)(Vj) = true; 

{(^ [^; I G HI, 

if 7r(rc)(Ii) = true and 7r(rc)(Vi) = false; 

IcsSra’ I G*'(») u {»}), 

if 7r(rc)(Ii) = true and 7r(rc)(Vi) = true 



3.4 Satisfaction Relation 

Now we formally present our semantics framework by defining inductively the 
satisfaction relation between a situation and a formula as follows. Firstly, 




A Logical Framework for Knowledge Sharing in Multi-agent Systems 567 



{M,w) P iff Tr{w){p) = true, for primitive propositions p. Secondly, 
{M,w) 1=^ -'a iff not (M,w) |=^ a, and (M,w) |=^ a A /3 iff (M,w) |=^ a 
and (M,w) \=^ f3. Finally, (M,w) Kia iff {M',w') ol for all {M',w') 
G Si{M,w). 

We remark that according to this semantics, a veridical agent takes the infor- 
mation he/she was told and adds the current world to the set of possibilities to 
obtain its knowledge. However, it seems difficult to construct such agents because 
an agent typically does not know which is the current world and it is impossible 
for an agent to perform this operation of adding the current world. Nevertheless, 
this does not imply this semantics is counter-intuitive and suspectable. There 
might be no veridical agent in the real world, but there are agents that do have 
the veridicality properties from some other agents’ viewpoints. For example, if 
agent j thinks that agent i is veridical, then, from agent j’s viewpoint, agent i 
does not yet know the current world, but agent j knows (or believes) that the 
current world is one of agent j’s possible worlds. Thus, if the agent j supposes 
that the current world is w and agent i’s set of possible worlds (corresponding 
to its told information) is W, then agent j thinks that agent i’s knowledge is 
determined by the set W adding the supposed current world w. 

4 Proof Theory 

This section gives the proof theory of our logic. Let K„(T) be the well-known 
system for the modalities Ti instead of Ki. Clearly, the axioms P and K 
and the inference rules in the system Kn(T) are valid with respect to the se- 
mantics presented above. In addition, in the absence of Ki, the system K„(T) 
is also complete with respect this semantics. Furthermore, we capture the fol- 
lowing important properties of the relationship between told information and 
knowledge. 

Theorem 1 Let </> and ip be arbitrary formulas o/£^(VI), and ip i-objectivef 
Then the following formulas are valid, i.e., satisfied in all situations (M,w): 

1. {Ki<p A Ki{4> ^ (fi)) ^ Kip 

2. (“'li A ~'Vj) ^ {Ki<p Ti(p) 

3. (-1, A V,) {K,<P ^ {T,<p A (/>)) 

j. li KiJ-i 

5. (li A ~'Vj) {Kiip Tiip) 

6. (li A Vi) {Kiip ^ T^ip A Ip) 

1. (li A -'Vi) => {{Ti(p KiTi(p) A {-•Ti<p Ki^T^fp)) 

8. (li A Vi) ^ {{{Tip) A(p) ^ KiTip)) A ((-■£*(/) V -^p)) ^ K^-^Tip>))) 

This theorem is easy to check by using routine method; we omit its proof 
here. Let TK{^^ be the system that results from adding the above valid formulas 
for each i to the axioms for the system K„(T). Thus, Theorem 1 implies that 
TKff^ is sound with respect to our semantics. 

^ We define i-objective formulas as Boolean combinations of primitive propositions 
(except li) and formulas of the form Tjp>, j i. 
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Theorem 2 For each formula 4> in (VI) , there is a formula (j)' such that in 
the system TK^^, the two formulas are provably equivalent, but the latter does 
not have any appearance of Ki ’s. 

The detail of this proof is omitted for limited space. 

For convenience, for an arbitrary formula in (/> in Tj(VI), by ^ we denote 
a formula which provably equivalent to (f> and has no appearance of KiS. By the 
above result it follows immediately the following theorem. 

Theorem 3 (Soundness and completeness) For the formulas in £^(VI), 

the system is a sound and complete axiomatization with respect the se- 

mantics above. 

Proof. The proof of the soundness is easy; axioms TKi • • • TKg are valid 
by Theorem 1 and other axioms and inference rules are trivially true. As for 
the completeness, assume that F is an arbitrary set of consistent formulas. We 
must prove that F is satisfied by some Kripke structure. Let the formula set 
F' = {ip\ (/? G T}. By Theorem 2, F is provably equivalent to F' , and it suffices 
to show F' is satisfied by some Kripke structure. But it is well-known that K„(T) 
is complete with respect to general Kripke structures; we therefore conclude that 
F' is satisfied by a Kripke structure. ■ 

We end this section by pointing out that 



(li A Vi) {Ki->Ti<j) {-•Ticj) V -•(/))) 

is valid, and thus the formula K^-^Tip does not necessarily imply ~'Tip even under 
the condition (1^ A V^). It follows that Axiom T doesn’t hold for our extended 
language, which makes this system novel in contrast to usual combinations of 
two classes of modalities (or modal logics). 

This also sheds some light on the appropriateness of S5 as a logic of knowl- 
edge. The introspection axiom of S5 has been attacked at some length in the 
literature of philosophy and artificial intelligence (see, for example, [10]). Our 
research indicates that 

— for veridical and introspective agents, logic S5„ is an appropriate logic of 
knowledge; and 

— the introspection axiom reasonably stands even if there are other interactive 
modalities in the picture. 

Nevertheless, the veridicality axiom, which was not attacked before, does not 
necessarily hold in the presence of other modalities with respect to our semantics. 

5 Discussions 

This section further shows the significance of our logic in the both philosophical 
and AI respects. 
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5.1 General Logic of Knowledge 

Consider the language £„(VI) that consist of those formulas in £^(VI) without 
any appearance of modalities T^. Our semantics can be used to interpret those 
formulas. This yields a general logic of knowledge such that different agents may 
have different reasoning styles. 

The approach to the above purpose seems to be straightforward via multiple 
versions of Kripke structures in which different agents can have different types 
of accessibility relations. However, in such an approach, the reasoning style of a 
fixed agent, from viewpoints of different agents, remains unchanged. Actually, it 
is viewed as common knowledge among all agents concerned. This leads to simple 
classification of agents provided by three properties: veridicality, introspection 
and consistency. Such a classification is not appropriate for most general systems 
of multiple agents. For example, very few agents are of veridicality property in 
every context. 

5.2 Only Knowing 

The notion of only knowing (see [12]) plays an important role in knowledge 
representation, which closely related to formalisms of nonmonotonic reasoning, 
such as default logic [16] and circumscription [13]. Several notions of multi- 
agent only knowing have been developed under the assumption that all agents 
use the same logic [9,6,7]. However, in the context of heterogeneous systems, this 
assumption is no longer appropriate. 

Based on the results in this paper, we will propose a general notion of multi- 
agent only knowing. In our approach, agent i only knowing something results 
from the agent only being told something, which is easier to formalize because 
the logic styles for the modality T^s are fixed K despite the multi-formity of 
logics of agents’ knowledge. This is also intuitively reasonable; after all, the less 
is told, the less is known. 

6 Conclusion 

In this paper, we have developed a model for reasoning about not only agents’ 
knowledge and told information but also some key reasoning properties of agent’s 
knowledge such as the introspection. These reasoning properties are regarded as 
styles of how an agent transforms the rough information that has been told, into 
its own knowledge after checking and cogitating on the former. By using Kripke 
structures abnormally, we have presented a new framework of semantics, and we 
give a sound and complete proof theory with respect to the semantics. 

Future work includes building more realistic models of knowledge and com- 
munication, where more parameters of properties of accessibility relations may 
be taken into account. It is also interesting to consider temporal factors in those 
models. Moreover, it is worth to explore the notion of only knowing derived by 
that of only being told, and compare them with those of previous treatments 
[12,9,6] on the notion of only knowing. 
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Abstract. We propose a lockout avoidance algorithm for the fc-exclusion 
problem on the asynchronous multi- writer/reader shared memory model. 
This algorithm is the extension of an accelerated version of the n-process 
algorithm to the fc-exclusion problem. The running time by the algorithm 
for the trying region of any faultless process is bounded by (n — fc)c -I- 
0{n{n — k)‘^)l if the number of stopping process failures is less than fc, 
where n is the number of processes, I is an upper bound on the time 
between two successive atomic steps for any faultless process, and c is 
an upper bound on the time that any user spends in the critical region. 

key words: asynchronous processes, concurrent computation, fc-exclus- 
ion, lockout avoidance, lockout freedom, shared memory 



1 Introduction 

Mutual exclusion is a problem of managing access to a single indivisible resource 
that can only support one user at a time. The fc-exclusion problem is a natural 
extension of the mutual exclusion problem. In fc-exclusion, some number of pro- 
cesses, specified by the parameter fc, are allowed to be concurrently inside the 
critical region, where corresponding users can use the resource. This extension 
was first defined and solved by Fischer, et al. on the shared memory model [4]. 
Afek, et al.[l] gave another solution to the fc-exclusion problem. Unlike the so- 
lution by Fischer et al. this solution does not use a powerful read-modified-write 
primitive. By the use of a concurrent time-stamp system [2,3], it requires only 
bounded read/ write shared memory [1]. 

The n-process algorithm for the mutual exclusion problem was proposed 
by Peterson [7], and accelerated by Igarashi and Nishitani [5]. The algorithm 
proposed in this paper is an extension of these algorithms to the fc-exclusion 
problem on the asynchronous multi- writer/reader shared memory model. 

The algorithm is immune to stopping failures of fewer than fc processes. All 
logarithms used in this paper are to the base 2. We impose an upper bound I on 
the time between successive atomic steps of each faultless process in the trying 
region and the exit region, and an upper bound c on the time that any user 
spends in its critical region. Let n be the number of processes in a system. The 
running time by the algorithm is bounded by (n — fc)c -I- 0(ji(n — k)'^)l. When 
fc = 1 (i.e., the mutual exclusion case), the running time {n— l)c-\- 0{n^)l of the 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 571-575, 2001. 
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algorithm proposed in this paper is better than the running time 0{n^)c+0{n^)l 
of the n-process algorithm by Peterson [5,7] and as fast as the running time of 
the accelerated versions by Igarashi and Nishitani [5]. 

2 Preliminaries 

The asynchronous multi- writer/reader shared memory model consists of pro- 
cesses and shared variables. Interactions between a process and its correspond- 
ing user are by input actions from the user to the process and by output actions 
from the process to the user. All communication among the processes is via the 
shared memory. The model is called an I/O automaton [6]. 

A user with access to the resource is modeled as being in a critical region. 
When a user is not involved in the resource, it is said to be in its remainder 
region. In order to gain admittance to its critical region, a process executes a 
trying protocol. The duration from the start of execution of the trying protocol 
to the entrance of the critical region is called the trying region. After the end 
of the use of the resource by a user, the corresponding process executes an exit 
protocol. The duration of the execution of the exit protocol is called the exit 
region. The /c-exclusion problem is to devise protocols for at most k processes 
to be concurrently in the critical regions. 

We assume that the n processes are numbered 1 , • • • , n. Each process i cor- 
responds to user Ui (1 < t < n). The inputs to process i from user Ui are tryi 
which means a request by user Ui for access to the resource, and exiti which 
means an announcement of the end of the use of the resource by Ui. The output 
from process i to user Ui is criti which means the grant of the resource to Ui, 
and rerrii which tells Ui that it can continue with the rest of its work. 

The following conditions should be satisfied. 

(1) There is no reachable system state in which more than k processes are in 
the critical regions. 

(2) If at least one faultless process is in the trying region and less than k processes 
are in the critical region, then at some later point some process enters its 
critical region. 

(3) If a faultless process is in the exit region, then at some later point the process 
enters its remainder region. 

(4) If all users always return the resource and if the number of faulty processes 
of the stopping type is at most f < k, then any faultless process wishing to 
enter the critical region eventually does so. 

Conditions (1), (2), (3) and (4) above are called respectively fc-exclusion, 
progress for the trying region, progress for the exit region, and fc-lockout avoid- 



ance. 
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3 A Lockout Avoidance Algorithm 

procedure {n,k)-EXCL 
shared variables 

for every s G {1, n — k}: 

turn{s) G {1, initially arbitrary, writable and readable by all 

processes; 

for every t G {1, n}: 

flag{i) G {0, n — k}, initially 0, writable by i and readable by all 

j ^ 

process i 

input actions {inputs to process i from user Ui}: 
tryi, exiU; 

output actions {outputs from process i to user Ui}: 
criti, rerrii] 

** Remainder region ** 
tryi: 

for s = 1 to n — fc do 
begin 

flag{i) := s; 
turn(s) := i; 

waitfor [|{j|j yf i : flag{j) > s}| < n — s — 1] or [turn{s) yf i] 
end; 
criti ; 

** Critical region ** 
exitf. 

flag{i) := 0; 
rerrii; 



For each level s (1 < s < n — fc), the statement waitfor[|{j| j yf i : flag{j) > 
s}| < n— s— 1] or [turn{s) yf i] in {n,k)-EXCL is not an atomic step. It consists of 
0{n) atomic steps. For each s, the contents of the local variable counti does not 
necessarily represent the number of other processes that are located at level s or 
above. Since the shared memory used in this paper is not the read-modified-write 
model, this uncertainty cannot be avoided. 

4 Correctness of (n, k)-EXCL 

We may assume that any process is initially in the remainder region. Throughout 
this paper, we assume that the number of stopping process failures is always at 
most fc — 1. In an execution by (n, k)-EXCL, process i is said to be a winner at 
level s if it has left the waitfor statement in the sth loop of the for statement. 
Note that if a process is a winner at level s then for any 1 < t < s, the process 
is also a winner at level t. For each t (1 < t < n), when process i has entered 
its exit region, the qualification for the winner of process i is canceled. We omit 
the proof of the following lemma due to the space limitations. 
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Lemma 1. In any reachable system state of{n, k)-EXCL, for any 1 < s < n—k 
there are at most n — s winners at level s. 

The next theorem follows immediately from Lemma 1. 

Theorem 1. {n,k)-EXCL guarantees k-exclusion even if any number of process 
failures of the stopping type exist. 

It is obvious that (n, k)-EXCL guarantees the progress for the exit region 
In order to prove the progress for the trying region, and /c-lockout avoidance, it 
is enough to give a time bound for the trying region. 

Theorem 2. Suppose that the number of stopping failures of processes is always 
at most k—1. In {n, k)-EXCL, the time from when a faultless process enters its 
trying region until the process enters its critical region is bounded by (n — k)c + 
0{n{n — kY)l. 

Proof. For each s, I < s < n — fc, define E{s) to be the maximum time from 
when a faultless process has entered the competition at level s until it becomes 
a winner at level s. The worst situation for a faultless process at level n — k to 
reach the critical region is the following case. Consider the case where there are 
k + 1 winners at level n — k — 1 and all of them are competitors including the 
process itself at level n—k, and then the process becomes a loser among them 
at level n — k. 

In the worst case, the time to decide k winners at level n — fc is at most 
(2n + 3)L The time duration includes 2 atomic steps for setting flag values and 
setting turn(n — k) at level n—k, the time to test the condition in the waitfor 
statement at level n — k, and local computing time for each faultless process to 
count the number of flag^ffs with values not less than n—k and for checking 
the value of turn{n — k). Since the number of stopping failures of processes is 
at most /c — 1, it is not possible that all the k winners are faulty processes. The 
time from the end of the competition at level n — k until the time when at least 
one faultless winner resets its flag value to 0 in its exit region is at most 3^ + c. 
Then the loser at level n — k becomes a winner at the level within (2n+ 1)1 time 
after the reset of the winner’s flag value. Hence, F{n — k) < (4n + 7)1 + c. 

The worst situation for a faultless process at level s, 1 < s < n — k — 1, 
to become a winner at level s is the case where n — s + 1 processes have just 
entered the competition at level s, or some of the processes have stopped due 
to their stopping failures after they had become winners at level s, whereby the 
faultless process becomes a loser at level s. In this case, the time required to 
decide the winners at level s is also at most (2n + 3)1. The faultless loser at 
level s will become a winner when a new process joins the competition at level 
s and then sets turn{s) to be its process name, or when at least one winner 
at the level eventually resets its flag value to 0 in its exit region. In the worst 
case, it is sufficient to consider the latter case. In order that the loser at level 
s becomes a winner at that level, it must wait until at least one winner at 
level s exits from the critical region. Since the number of faulty processes is at 
most k—1, there exists a faultless winner at that level. The quickest winner 
at level s can reach the critical region within (n — k — s)(2n + 3)1 time. Hence, 
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the loser at level s can move up as a winner at level s to level s + 1 within 
F{s) < {n — k — s+l){2n + 3)l+{2n+l)l + 3l + c = {n — k — s + 2)(2n + 3)l + l + c. 
Thus the time from when a faultless process has entered its trying region until 
it enters its critical region is bounded by 

n—k n—k—1 

E \{ji — k — s 2)(2n 3)/ 1 c)] -t- (dn -t- 7)/ -t- c 

S=1 S=1 

< (n — k)c + 0{n{n — kY)l. 

□ 



The next theorem fallows immediately from Theorem 2 and Theorem 3. 

Theorem 3. {n,k)-EXCL solves the k-exclusion problem, and it is k-lockout 
avoidance. 

5 Concluding Remarks 

The total size of the shared memory for (n, k)-EXCL is {n—k) [logn] +n|"log(n— 
/c + 1)] bits. We are interested in a problem of whether some modification of 
these algorithms can reduce the shared memory size. We are also interested in 
a problem of whether we can speed up our algorithm by simple modification for 
the case where both k and n — k are 0{n) or for the case where fc or n — fc is 
0(1). The ^-assignment problem is closely related to the fc-exclusion problem. 
We are tempted to solve the fc-assignment problem without using concurrent 
time-stamps by similar techniques. 
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Abstract. We study prefix-free presentations of computably enumer- 
able reals. In [2], Calude et. al. proved that a real a is c.e. if and only if 
there is an infinite, computably enumerable prefix-free set V such that 
a = Following Downey and LaForte [5], we call V a prefix- 

free presentation of a. Each computably enumerable real has a com- 
putable presentation. Say that a c.e. real a is simple if each presentation 
of a is computable. Downey and LaForte [5] proved that simple reals 
locate on every jump class. In this paper, we prove that there is a non- 
computable c.e. degree bounding no noncomputable simple reals. Thus, 
simple reals are not dense in the structure of computably enumerable 
degrees. 



1 Introduction 

This paper investigates the relationship between the c.e. degrees (the compu- 
tational complexity classes under Turing reducibility of sets of positive integers 
which can be effectively enumerated) and the presentations of computably enu- 
merable reals. Investigation on effectiveness of reals originated in Turing’s paper 
[11], and has now become one of the central topics of computable analysis and 
algorithmic information theory. 

Say that a real a is computable if a is the limit of a computable sequence of 
rationals {r^ : i G lo} and there is a computable algorithm M such that for all n, 
ja— I < 2“". All algebraic numbers, tt, the Euler number e, and all numbers 
commonly used in numerical analysis are computale. Note that M here controls 
the rate of the convergence of {vi : i G co}. Effectivity of M is crucial because 
there are computable sequences of rationals converging (non-effectively) to non- 
computable reals, see Spector [10]. Based on this, Soare [8] defined computably 
enumerable reals as follows: 

Definition 1 We call a real a computably enumerable (c.e., for short), if a is 
the limit of a computable sequence of rationals {ri : i G w}. 
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For simplicity, we only consider reals between 0 and 1. In [8], Soare proved 
that a is c.e. iff the lower (left) Dedekind cut of a is computably enumerable. 
For more equivalent definitions of c.e. reals, we need a technical notion whose 
use is crucial in the investigations of randomness of reals. 

Let U = {0,1} be the binary alphabet, and U* be the set of finite binary 
strings. For cr € U*, let |cr| be the length of ct. A C 17* is prefix-free if for 
any (Ti,(T 2 G A, if cti is a prefix of (T2, then cti = (T 2. Say that C is a Chaitin 
machine (self-delimiting Turing machine) if C is a Turing machine processing 
binary strings such that its program set (domain) PROGc = (cr G S*\C{a) 
halts} is prefix-free. Clearly, PROGc is computably enumerable (c.e. for short). 
Conversely, each prefix-free c.e. set of strings is the domain of some Chaitin 
machine. The program-size complexity of the string a € S* (relatively to G) is 
defined as Hc{<j) = min||?7| : 77 G E*,G{ii) = cr}, where min0 = 00. 

Theorem 1 (Invariance Theorem) There is a Chaitin machine U (called uni- 
versal) such that for every Chaitin machine G, Huicr) < Hc{cr) 0(1). 



Note that PROGu is c.e., but not computable. 

Generally, for any prefix-free languages, A say, converges to a 

real in [0,1], i.e., < 1- This is called the Kraft’s Inequality. We use 

2~^ to denote Thus, by the property of PROGu mentioned above, 

we have that js ^ c.e., but noncomputable, real in the unit interval. As 

usual, we use fiu to denote the real Calude, Hertling, Khoussainov 

and Wang [2], Downey, Hirschfeldt and Nies [4], Kucera and Slaman [6] provide 
extensive investigations of the randomness of flu- 

The following are equivalent definitions of c.e. reals. 



Theorem 2 (Calude et. al. [2]) For any a in the unit interval, the following 
are equivalent: 

(1) a is computably enumerable. 

(2) The left Dedekind cut of a is computably enumerable. 

(3) There is an infinite computably enumerable prefix- free set V C S* such 
that a = 2“^. 

(4) There is a computable prefix-free set V C E* such that a = 2“^. 

(5) There is a computable increasing sequence of rationals with limit a. 
Based on characterization (3), Downey and LaForte [5] proposed the notion 

of presentations as follows: 



Definition 2 (Downey and LaForte [5]) Say that F C F7* is a prefix- free 
presentation of c.e. real a if (1) F is a prefix-free c.e. set; (2) a = 2“^. 

By (4) of Theorem 2, each c.e. real has a computable presentation. Say that 
a c.e. real a. is simple if all presentations of a are computable. Since simple c.e. 
reals resemble computable reals, it is tempting to conjecture that simple reals 
might be related to the known classes of c.e. sets resembling computable sets 
(like the low sets) in the sense of the jump hierarchy. Downey and LaForte [5] 
refuted this by showing that there are simple c.e. reals which are high. 
Theorem 3 (Downey and LaForte [5]) There is a simple real a with a' =1- 
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From the point of view of computability theory, Downey and LaForte’s result 
suggests to asking whether all c.e. degrees contain some simple reals, or whether 
the degrees of simple reals are dense in the class of all c.e. degrees. In this paper, 
we prove that answers to these two questions are both negative. 

Theorem 4 There is a noncomputable c.e. set A such that A bounds no non- 
computable simple reals. That is, for any noncomputable c.e. real a € [0,1], if 
a <T A, then a has a noncomputable presentation. 

Here, a <t A means that there is a (partial) computable functional ^ such 
that a = ^{A), namely, the real 0.^(H; 0)^(H; 1) • • • is a binary expansioanry of 
a. Theorem 4 is proved by induction as follows: 

Theorem 5 There is a noncomputable c.e. set A s.t. for any n > 0, for any 
noncomputable c.e. real a G [1 — ^,1— if ct <t A, then a has a noncom- 

putable presentation. 

The main part of this paper is devoted to the proof of the base step n = 0: 
Theorem 6 There is a noncomputable c.e. set A s.t. for any noncomputable 
c.e. real a G [0, |], if a <t A, then a has a noncomputable presentation. 

The successor step of the proof is: Fix A as in Theorem 5, assume that we 
have Theorem 5 for n. We now consider the case n+1. Fix a G [1— 1— ]> 

a <T T as a noncomputable c.e. real. Then (3 = 2a — 1 G [1—^,1 — 

(3 <T A is also a noncomputable c.e. real. By the induction hypothesis, (3 has 
a noncomputable presentation Vg. Define Va = {Oct : ct G Vg}U{l}. Then Va 
is c.e. and noncomputable. Prefix-freeness of Vq comes from that of Vg directly. 
Also we have 

2-v. = y- 2-1,1 = 2-l»"l + i = i E 2-'"' + ) = + { = «■ 

r/GVa <y^Vj3 

Therefore, Va is a presentation of a. □ 

2 Requirements 

In this section we list the requirements necessary to prove Theorem 6. We will 
construct a c.e. set A satisfying for all i the requirements: 

Vg a ^ Wg, 

TZg [ai = = 2“^* G [0, \] & Ui is c.e. & Fi is prefix-free] [oj is 

computable V (3 noncomputable c.e. Ui){Ui presents ai)]. 

Here we assume that {Wiji^ui is a listing of all c.e. sets of natural numbers, and 
{{ai,<Pi,Fi)}i^i^ is a listing of all triples, {a,(I>,F) such that a is a c.e. real, is 
a partial computable functional, and F is a c.e. set of strings. 

Since TZi is a complicated requirement, we break it into simpler subrequire- 
ments Sij as follows: 

S^,r Ui ^ V,. 

Here is a listing of all c.e. sets of strings in S* . 
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3 Basic Modules for Each Requirement 

We now describe the atomic strategies (modules) for each type of requirement 

'Pi, 'P'iySiJ. 

(3.1) The p-module for Pi 

A P-strategy, p say, trying to figure out a number x such that A(x) yf Wi(x), 
is the standard Friedberg-Muchnik procedure, p runs as follows: 

(1) Choose a big number, x say. 

(2) Wait for <Pi(x) 

(3) Put X into A, and stop. 

p has two finitary outcomes: w and s, where w denotes the fact that p waits at 
(2) forever, and s denotes the fact that p reaches (3) and stop. In both cases, 
A(x) yf Wi(x), and Pi is satisfied. 

(3.2) The r-module for TZi 

Let b® the approximation of at stage s, Fi is a c.e. set of strings 

presenting Ui. W.o.l.g., we assume that for all s, ai[s] < X)<reF 
Define the distance between ai and 'Pf as follows: 

e(i,s) = maxja: < s : (Vy < x)['l>f(y)[s\ J,]}, 
m£(i, s) = maxjO, £(i, t) : t < s}, 
d(i,s) = \ Y.y<i{i,s) ^f{y)[s] ■ 2"^ - q;*[s]|, 
md(i, s) = minjO, d(i, t) : t < s}. 

Say that stage s is t-expansionary if s = 0 or if i(i,s) > max{m(t, s), |cr| : a 
is assigned to some 5^ ^-strategy}, d(i,s) < min{mfi(i, s), 2“l'^l : a is 

assigned to some 5ij-strategy}, and ^ 2“^*'* for all t < s. 

The T-module has a (finitary) outcome 1, in case that there are only finitely 
many t-expansionary stages, and an (infinitary) outcome 0, in case that there 
are infinitely many z-expansionary stages. 

If T-module has an outcome 0, then r is responsible to construct an auxiliary 
c.e. set of strings Ur such that Ur is noncomputable, and that Ur presents 
Ur is constructed by r’s substategies. Links from r to these substrategies will be 
created and removed alternatively in the whole construction. We will also ensure 
that s infinitely often, we can update Ur such that 

Xcrec/^ s 2“l'^l = ar,s- We can do this by Chaitin-Kraft inequality. Thus, we can 
effectively expand the prefix-free set Ur,s to a larger one with the prefix-freeness 
preserved. Since ai G [0, ^], we can always choose the strings for the updating 
process as Ict, and leave the strings of the form 00 • • • 01 for the 5^ ^-strategies, 
j G Lo. This arrangement avoids the interaction between the diagonalization and 
the updating process. 
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(3.3) The ? 7 -module for Sij 

The ry-module is the “gap-cogap” argument, which is the key part of the 
proof. 77-module requires various parameters such as (7(77), r(?7), whose values at 
the end of stage s will be denoted by 17(77) [s],r (77) [s]. 77-module has two objec- 
tives. One is to find some string a witnessing 11^1 yf Vjj, and if this fails, 77 will 
ensure in a definite way that the real (namely, o;T(r;)) is computable. In the 
first case, 77 itself is satisfied. In the second case, TZi is satisfied and no further 
subrequirements j, need to be examined. 

The 77-module consists of the following steps: 

(1) Choose a string cr = 00 • • • 01, with |cr| big. 

(2) Wait for a stage s such that a G Vj[s\. At stage s -I- 1 open a gap by 
setting r[s -|- 1] = 0, and go to step (3). Stop updating Ui until this gap is 
closed. 

(3) Wait for the least t-expansionary stage t>s+l such that £{i,t) > \a\. 
At stage t+ \ close the gap by performing step (3a) or (3b) according to which 
case applies: 

(3a) (Successful close) Suppose that ai[t] — o;i[s] > Enumerate a 

into Ui, update Ui and stop. 

(3b) (Unsuccessful close) Otherwise, define r\t -I- 1] = t (to preserve 

[t] \ |ct|). Cancel cr, update Ui and go to step (1). 

Note that if and Fi is a presentation of Oj, then every gap must 

be eventually closed, and so Ui will be updated correctly. 77-module has three 
possible outcomes: 

w. In this outcome, the 77-module opens at most finitely many gaps and 
never closes one successfully. 

Let s be the last stage at which 77 closes its gap, and a be chosen at stage 
t > s. Then a never goes into Vj, because otherwise, 77 will create another link, 
which is contrary to the choice of s. Thus, Vj(a) = 1 — Vj{a) = 1 ^ 0 = Ui{a) . 
Sij is satisfied. 

s. There is some stage s -I- 1 at which 77 closes a gap successfully. 

In this case, a = cr[s] is enuemrated into Ui at stage s -I- I. Suppose that at 
stage vi < s, a was defined as cr[s], and at stage V2 G [ui,s), 77 opened a gap 
trying to enumerate a into Ui. By the module, ai[s] — ai[u2] > since the 

updating process does not perform during the gap, we are eligible to put a into 
Ui- Thus, C/i(cr) = 1 yf 0 = 1 — Vj(a), Sij is satisfied. 

g. In this outcome, 77 opens infinitely many gaps and closes each one unsuc- 
cessfully. 

Obviously, the limit of the length of the candidates (strings) is 00. Suppose 
that the infinitely many gaps are begun at stages {tn}neui and closed at stages 
{vn}neujj where to < < wi < • • • . The intervals of stages {s : t„ < t < u„} 

are called gaps, since r = 0 in these intervals and we are free to enumerate any 
number into A to make A noncomputable, while the intervals {s : < s < tn+i} 

are called cogaps and we have r = Vn- 

ai is computable because: 
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Given n. Choose a stage t large enough such that a gap is begun at stage 
t + 1, and the corresponding string a has length |cr| > n + 3. Since the gap is 
closed unsuccessfully at stage u-l-l we have ai[v] — Ui\t] < . If the 

next gap is begun at stage t' -I- 1 > w -I- 1, then the A-restraint r ensures that 
r klH = f Thus, 

ai[t'] - ai[v] 

< \ai[t'] - 'Ey<e{i,v) (J/)H • 2"^| + I 'Ey<e{i,v) (y)H ’ 2”^ - ai[v]\ 

< ■ 2-^1 + 1 Ey<m,tn ^fivW] ■ 2 -" 

■ 2 ^l + IEy<t(i,„)^ny)H -2 

= 3-2-l‘^l. 

Therefore, we have 

a*[t'] - <Xi[t] = - ai[v\) + {ai[v\ - ai[t]) 

< 3 . 2-l‘^l 2“l'^l = 

Now the argument continues with a replaced by a new one, cr' say. This estab- 
lishes the following: 

cti - ai[t] < 2-(I‘^I-2) -h -h • • • < 2-(I'"I-3). 

Since |cr| > n + 3, we have at — ai[t] < 2“". Thus, is computable. 

In this case, liminf^ r[s] = 0, because r is defined as 0 during gaps. 
g is an infinitary outcome, while w, s are both Unitary. 



4 The Priority Tree 

Let A = {0, l,s, 5 ,w;} with the order </i in the order (from left to right) as 
listed. T is defined as follows as a subset of the finite sequences of A. 

(4.1) (The priority tree) (1) Define T = {^ G : (Vn)[^(3n) € {0,1}, 
^(3n + 1) G (s, w} & ^(3n -I- 2) G (s, g, w}]}. 

(2) Define [T] = {h € A‘^ : (Vn)[h f n G T]}, the set of all infinite path 
through T. 

Every node t G T with |t|= 0 (mod 3), will be assigned to the 7^-requirement. 
The outcome 1 for r denotes that there are at most finitely many r-expansionary 
stages, so t believes that TZ is satisfied. The outcome 0 denotes that there are 
infinitely many r-expansionary stages, and that further actions to satisfy TZ must 
be taken by nodes rj G> t. 

Every node p G T with \p\ = 1 (mod 3), will be assigned to the P-requirement. 
Every node rj G T with |? 7 | = 2 (mod 3), will be assigned to some subrequire- 
ment Sij. The ry-strategy will be a variant of the basic module described in the 
section above, having outcomes from |s, 5 ,w;}. 

(4.2) Define (partial) functions i,j : T -G u and lists Lq,Li,L 2 , by induction 
on n = 1^1 as follows. 
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n = 0. Define as above, i(A) = 0, and set Lo(A) = ii(A) = L 2 {\) = w. 
n > 0. Let be the immediate predecessor of 

Assume that *(^“), Lo(^“), L2(C~) are defined, and that j{^~) is 

defined if is an 5-strategy. First define Lo(C)>^i(?) and i2(C) as those in 
the next paragraph. If ^ is an 7^-strategy, define i(^) = G Lo(^)], and let 
j(5) be undefined. If ^ is a P-strategy, define i{^) = G Li(C)]> and also 
let j(?) be undefined. If ^ is an 5-strategy, define z(^) = zq, j(C) = Jo where 
(*o,Jo) = G ^2(0]- 

Case 1: |^“| = 0 (mod 3). Adopt the subcase below. 

subcase (i): a = 0. Define Lo(^) = Lq{^~) — {*(C~)}) and 

a2(c) = L2(n- 

subcase (ii): a = 1. Lo(^) = (Lo(?“) - {*(C“)}) U {z' : i' > z(^“)}, Li(^) = 
■bi(C")U{z' : z' > z(C")}, and ^2(6 = {L 2 (i~) - {{i(^~),k) : k G w})U{(z',fc) : 
z' > z(^“) & A: G w}. 

Case |C| = 1 (mod 3). Define Lo(C) = Lo{^~), Li(C) = Ci(^“) - {z(^")}, 
and L 2(0 = L 2 {^~)- 

Case 3: |^| = 2 (mod 3). Adopt the subcase below. 

subcase (i): a = s or a = w. Define Lq{^) = Lo{^~), Li(0 = and 

L2{o = L2in-mn,M-))}- 

subcase (ii): a = g. Define Li{() and C2(C) as in Case 1, subcase (ii). 

This completes the priority assignment. 

5 Construction 

At the end of the construction, we define the true path f G [T] of the construction. 
/ is approximated via <5[s] G T at each stage s, and we will have / = liminfs i5[s]. 
During stage s, we define (5[s] via a series of substages t < s at which we define 
a string 6t such that St C 5t+i and (i[s] is the last 5t defined. Only string 5t is 
allowed to act at substage t. 

(5.1) (Genuine stages) For ^ G T, 

(1) Say that s + 1 is a Cstage if ^ C <5[s +1]. In addition 0 is a Cstage. Let 
5^ be the set of all ^stages. 

(2) Say that s + 1 is a genuine Cstage if ^ = 5* for some substage t of stage 
s + 1, where 6t will be defined during the construction. Let be the set of 
genuine ^stages. 

(5.2) (Expansionary stages) Fix r. At stage s+ 1, define the length functions 
as: 

£{t,s) = max{x < s : (Vy < x)[^^{y)[s] i]}, 
m£{T, s) = maxjO, £{t, t):t<s&t+lG 5"^}. 

Also define the distance function as: 
d{T, s) = I 'Ey<e(r,s) (y)W ■ 2"*^ - ar[s]|, 
md{T, s) = min{0, d{T, t):t<s&t+lG 5"^}. 
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Say that stage v is r-expansionary stage, if v = 0, or if v = s -I- 1 G 
£{t, s) > max{m£(r, s), \<j\ : a is ever assigned to some 5i(,-)^j-strategy}, d(r, s) < 
min{m(i(T, s), : cr is ever assigned to some -strategy}, and 

a,-[s] > 2 “-^^'* for alH < s,t G S'^ . 

Suppose that at stage s, an 5-strategy r] opens a gap, then we shall construct 
a link (r, rj) with bottom rj and top r = T^rj). By the construction of T, we have 

(0) C 77 and -F’(r 7 )[s] = g. The link (r, 77 ) remains in force until either 77 closes 
the gap (either a successful or a unsuccessful close), or 77 is initialized. If at stage 
s -I- 1, there is a link (r, 77 ) and there is a substage t of stage s -I- 1 such that 
St = T and <5(+i = 77 , then we say that the link (r, 77 ) is travelled at stage s -I- 1. 
The purpose of the link is to allow the flow of control of the construction to pass 
directly from the top node r to the bottom node 77 at r-expansionary stages, 
to close the gap. The hypothesis Ut = ^t{A) guarantees that each link will be 
travelled (at most once) and the corresponding gap will be closed. 

Note that travelling the link (r, 77 ) may lead the construction to pass over 
some nodes ^,r C ^ C 77 of lower global priority, i.e., i(^) > 7 ( 77 ) = i(t), which 
would otherwise receive attention before 77 if we simply descended the tree with- 
out using links. For such s-k 1 is a ^-stage, but not a genuine ^-stage. This can 
result in the failure of Thus, to consider whether a given ^-strategy succeeds 
or not, we must examine the genuine ^-stages rather than merely the ^-stages. 
(5.3) (Construction) 

The construction is as follows: 

Stage s = 0. Initialize all nodes ^ G T. Deflne <5[0] = A. 

Stage s-k 1. The construction will proceed by substages t < s-k 1. We refer to 
substage t of stage s-k 1 as stage (s-k 1, t). The value of parameter p at the end of 
substage t will be denoted by pt, rather that the complete speciflcation pt[s-k 1 ]. 
At the end of substage t we shall have St £ T such that St C C 5[s -k 1], 
for t < s, and only St may act during substage t. At the end of stage s -k 1, if 
s -k 1 is a T-expansionary stage, and there is no link with top t created at stage 
s -k 1, where t is an 7?.-strategy, then perform the updating process for t by 
enumerating into Ur approriate strings to make = oir,s+i- This 

process keeps the resulted c.e. set C/r,s+i preflx-free. 

Substage t = 0. Deflne do = A. Go to substage 1. 

Substage t+1 < s-kl. Given St and for all 5-stretgies rj £ T, Ft{g) G {s, g, tu}, 
the current state of the 77 -module, deflne St+i as follows: 

Case 1. St is an 7?.-strategy. If s -k 1 is not d(-expansionary, then deflne St+i = 
St '' (1). If s -k 1 is dt-expansionary, and there is no link with top St, then deflne 
di_|_i = St ^ (0). Otherwise, s -k 1 is d^-expansionary, and there is a link with top 
St and bottom rj ^ St'^ ( 0 ), deflne St+i = rj- 

Case 2. St is a P-strategy. If St is satisfled, deflne dt+i = St ^ (0). Otherwise, 
deflne dt+i = d* ^ (I). 

Case 3. St is an 5-strategy. Deflne dt+i = St^ (Ft{St)). 

Let ^ = di_|_i. Say that ^ requires attention at substage (s -k 1, t + 1) if ^ has 
not been satisfled, and: 
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1. ^ is a P-strategy, and one of the following holds: 

(la) x{^) is not defined, or 

(lb) <?{(a;(0)[s] i- 

2. is an 5-strategy, and one of the following holds: 

(2a) (j(^) is not defined, or 

(2b) ^ is ready to open a gap, or 
(2c) ^ is ready to close a gap. 

If ^ does not require attention, and t < s, then go to substage t-|-2. Otherwise, 
choose the first one which applies and perform the action as follows. If t < s, and 
^ opens an ^-gap, then go to substage t -|- 2. Otherwise, define <5[s -f 1] = 5t+i, 
go to stage s -I- 2. 

Case 1: ^ = p is a P-strategy. If (la) happens, then define x{p) as a big 

number. If (lb) happens, then enumerate x{p) into A and say that Vp is 
satisfied. In any case, initialize all C > P- 
Case 2: ^ = ?7 is an 5-strategy. 

If (2a) happens, define a{r]) = 00- ••01 with |cr(? 7 )| big. (Clearly, {(j{ri)} U 
Ut(p),s is prefix-free.) Initialize all C > (w). 

If (2b) happens {This action is called step 1), open an p-gap by defining 
F{rj) = g, and r{rj) = 0. Initialize all C > p ^ (w). Create a link (t, p) with 
top r = T{rj), and bottom rj. 

If (2c) happens {This action is called step 2), let w -I- 1 < s -I- 1 be the stage 
at which the current p-gap was opened, and let a = cr(? 7 )[s]. Close the p-gap 
as follows: 

step 2a {Successful Close) If ap[s] — ap[v] > then enumerate 

a{r]) into and define F{r]) = s. Initialize all C > rj'^ (g). Set r{rj) = 0. 

step 2b {Unsuccessful Close) If a,,[s] — a,,[w] < then define 

F{rj) = w and r(p) = s {to preserve f |cr(? 7 )|), cancel ( 7 ( 77 ), initialize all 

C > 

This completes the construction. 

6 Verification 

Let / be the leftmost path. That is, the leftmost / G [T] such that for all 
strings ^ G T, if ^ <L / and f % f , then there are only finitely many stages 
where f C 5[s]. The following lemmas prove that our construction satisfies all 
requirements. 

(6.1) Lemma Fix n and f C f with |^| = n. Then, 

(1) (3“s)(e C 5,), 

(2) |G«| = oo. 

(3) If 1^1 = I (mod 3), then ^ receives attention only finitely often. 

(4) For any t C if |r| = 0 (mod 3), and r ^ (0) C then every link with 
top T is removed and there are infinitely many genuine ^-stages where there are 
no links with top r. 
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(6.2) Lemma Let p C ^ with \p\ = 1 (mod 3), and let x be a follower of p 
where x = limg a;(p) [s], the last follower of p appointed at stage si say, where 
We, Si n Asi = 0. Then 

(1) (Vs e G'’)[s > Si — >■ y > R{p, s)], where R{p, s) = max{r(C, s) : C <l p}- 

( 2 ) Pi(p) is satisfied. 

(6.3) Lemma Suppose that r C / with |r| = 0 (mod 3) and (1) C /. Then 

(6.4) Lemma Suppose that t] C f with \r]\ = 2 (mod 3). 

(1) If 77 ^ (s) C /, then Ur(ri) H yf 0. Indeed, a{rj) G Ur(n) H T,(p), where 
a{p) is the (unique) string enumerated into Ur{rj) when the (last) 7 ?-gap is closed 
(successfully) . 

(2) If ?7 ^ (g) C /, then is computable. 

(3) If 7 ? ^ (w) C /, then Ur(ij) U P,(p) yf E*. 

(6.5) Lemma Suppose that r C / with |r| = 0 (mod 3) and (0) C /. Then 

Ur is a presentation of Ur- Namely, Ur is prefix-free, and = Ot- 

Moreover, if ar is not computable, then Ur is also not computable. 

This completes the proof of Theorem 6 . □ 
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Abstract. Let C^c and C„c be classes of the semi-computable and 
weakly computable real numbers, respectively, which are discussed by 
Weihrauch and Zheng [12]. In this paper we show that both Csc and 
Ctoc are not closed under the total computable real functions of hnite 
length on some closed interval, although such functions map always a 
semi-computable real numbers to a weakly computable one. On the other 
hand, their closures under general total computable real functions are the 
same and are in fact an algebraic field. This field can also be character- 
ized by the limits of computable sequences of rational numbers with some 
special converging properties. 



1 Introduction 

The computability notion of real numbers is introduced by Alan M. Turing in 
his famous paper [9] on “Turing machine” . He called a real number x computable 
if it has a computable binary expansion. Namely, there is a recursive set A C N 
such that X = XA ■= EieA Equivalently, x is computable iff there 

is a computable sequence (cc„)„gN of rational numbers which converges to x 
effectively in the sense that |a; — a;„| < for all n G N. In fact, as pointed 
by R. M. Robinson [7], corresponding to any classical definition of real numbers 
there is an equivalent definition of computable real numbers. The class of all 
computable real numbers is denoted by Cg. 

Essentially, we can effectively find an upper bound and a lower bound of any 
computable cc G R within any given error. In other words, we can approximate 
the real number x effectively from both sides, below and above. Sometimes, a 
real number can only be approximated effectively from one side instead of two. 
As one of the first such example, Specker [8] has shown that the real number 
xa for a nonrecursive r.e. set A is effectively approximable from below but it is 
not effectively approximable from above. Such kind of real numbers are called 
left computable^. Namely, a real number is left computable if it is a limit of 

* Corresponding author, zheng@informatik.tu-cottbus.de 

^ Some authors call them recursively enumerable or computably enumerable (see, e.g. 

[ 2 ]). 

J. Wang (Ed.): COCOON 2001, LNCS 2108, pp. 586-595, 2001. 

(c) Springer- Verlag Berlin Heidelberg 2001 




Weakly Computable Real Numbers and Total Computable Real Functions 587 



some computable increasing sequence of rational numbers. The right computable 
real numbers can be defined accordingly. Left and right computable real number 
classes are denoted by C/c and C^c, respectively. 

It is not difficult to see that Ce is closed under the arithmetic operations, 
namely, it forms an algebraic field. However, the situation for Cjc and C^c is 
different. The first easy observation is, that x is left computable iff —x is right 
computable, and x is computable iff it is both left and right computable. From 
Specker’s example, it follows immediately that the classes Cjc and Crc are not 
closed under the arithmetic operations, hence they are not algebraic fields. More- 
over, Weihrauch and Zheng [12] have shown that there is a left computable real 
number x and a right computable real number y such that the sum x+y is neither 
left computable nor right computable. Therefore even the union C^c := CicUCrc 
(the class of semi- computable real numbers) is not an algebraic field. In other 
words, the closure of semi-computable real numbers under the arithmetic op- 
erations is a proper superset of the semi-computable real number class. This 
closure is denoted by C^jc and any real number in this closure is called weakly 
computable (w.c., for short). An interesting characterization of w.c. real num- 
bers is (see [12]), that x is weakly computable iff there is a computable sequence 
(x„)„gN of rational numbers which converges to x weakly effectively in the sense 
that 1^" “ Xn+i\ IS finite. 

As the weakest version of effectiveness on real numbers we can consider simply 
the limits of any computable sequence of rational numbers without any further 
restriction but the convergence. We will call a real number x recursively approx- 
imable (r.a., for short) if there is a computable sequence of rational numbers 
which converges to x. The class of all r.a. real numbers is denoted by C^o- As 
shown by Weihrauch and Zheng [12], the class C^jc is a proper subset of C^a 
and Cu,c is an algebraic field^. 

The class C^o shares a lot of mathematical properties with Cg. For example, 
they are all closed under the computable operations (i.e. partial computable real 
functions) and effectively convergent limits ([13]). On the other hand, the class 
Cwc does not share these properties any more, as shown by Zheng [13], although 
it is also an algebraic field. One of the interesting problems remaining open there 
is, whether C^c or Cu,g is closed under the total computable real functions. A 
negative answer to this question will be shown in this paper. 

During the discussion of closure properties for real number classes under 
the total computable real functions, it plays an important role, that whether 
a total computable real function has a finite length on some closed interval^. 
For example, we can easily show that the image f{x) of a semi-computable real 
number x under a computable real function / of finite length on some closed 
interval containing x is always weakly computable. But f{x) can be non- weakly 
computable for general total computable real functions /. Furthermore, we show 
that the class Csc is not closed under the total computable real functions, no 
matter the length of such functions. This holds also for the class In fact 



^ An earlier but weaker result that C^c C Cra is proved by Ceitin in [3] 
® The length of the a function means here the length of its graph. 
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we can even show that C^jc is not closed under the monotone computable total 
real functions. On the other hand the closures of C^jc and Cjc under general 
total computable real functions are in fact the same. This common closure is 
an algebraic field and is strictly between the classes C^c and Cra- At the end 
we show that this field can also be characterized by computable sequences of 
rational numbers with some restriction on their convergence. 



2 Preliminaries 



In this section we will recall some notions and related known results. Basic 
notions about classical computability theory are assumed. Since the real numbers 
can be described by rational numbers, the effectiveness notion on real numbers 
will be naturally introduced by means of the effectiveness notions on rational 
numbers. On rational numbers Q, the computability theory is well developed by 
classical computability theory. For example, the notions of computable function 
/ : Q — >■ Q, recursively enumerable set A C Q, etc. are clear and do not need 
any further explanation. Especially, we will call a sequence (xn)nGN of rational 
numbers computable, if there is a computable function / : N — >■ Q such that 
Xn = f{n). 

In the last section we have mentioned several classes of real numbers. The 
relationship among these classes looks like, as shown in [12], the following 



For C-u!c, we have the following equivalent descriptions. 

Theorem 1 (Weihrauch and Zheng [12]). For a; G M, the following are 
equivalent: 

1. X is weakly computable; 

2. There are left computable real numbers y,z gR. such that x = y — z; 

3. X is in the arithmetic closure ofCic; 

4- There is a computable sequence (a;„)„gN of real numbers which converges to 
X such that ~ Xn+i\ < c for some real number c. 

The computability notion for real numbers can be extended to sequences. A 
sequence (xn)„gN of real numbers is called computable, if there is a computable 
sequence (rnm)n,meN of rational numbers which converges to (cc„)„gN uniformly 
effectively in n and m. Namely, for any n,m G N, jr„m — x„\ < 2“'" holds . 

Now let us recall the definition of computable real functions. There are a 
lot of equivalent approaches to define the computable real functions (see e.g. 
[4,5,6,11]). Here we use the approach of Weihrauch [10,11] by so-called Type- 
2 Turing machines. A type-2 Turing machine M extends the classical Turing 
machine in such a way that it accepts infinite sequences as well as finite strings 
as inputs and outputs. For any input p, M{p) outputs a finite string q if M{p) 
halts in finite steps with q in its write-only output tape, while M{p) outputs 
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an infinite sequence q means that M(p) will never halt and keep writing the 
sequence q on its output tape. A real function^ / :C R — >• K. is computable, if 
there is a type-2 Turing machine M which computes / in the sense that, for any 
X G dom(/) and any sequence p (of rational numbers) which converges effectively 
to X, M{p) outputs a sequence q (of rational numbers) which converges effectively 
to f{x). It is easy to see that computable function is continuous at any point of 
its domain. 

In this paper we are mainly interested in the closure properties of some real 
number classes under the computable total real functions. Obviously, the real 
number x and x±n, for any n G N, have completely the same effectiveness in any 
reasonable sense. Therefore, w.l.o.g., we consider only the real numbers in the 
interval [0; 1] which can always be expressed by xa for some A C N. Similarly, 
we can restrict ourselves to the computable real functions / : [0; 1] — >■ [0; 1] for 
which we have the following equivalent descriptions. 

Theorem 2 (cf. Pour-El and Richards [6], Weihrauch [11]). 

Let / : [0; 1] — >■ R be any function. Then the following are equivalent: 

1. f is computable; 

2. There is a computable sequence {pn)neti of rational polygon functions which 
converges to f uniformly effectively. Namely, \f{x) — p„(x)| < 2“" for any 
X G [0; 1] and n G N. 

3. f is sequentially computable, i.e. f maps any computable sequence (a;n)nsN 

of real number to a computable sequence (/(x„))„gN; and f is effectively 
uniformly continuous, i.e. there is a recursive modulus e : N — >■ N such that 
'ix,y G [0; IjVn G N (|a; - y\ < ^ |/(x) - f{y)\ < 2"”). 

Item 2. of Theorem 2 is called Effective Weierstrass Theorem which implies 
immediately the following lemma where [0; 1 ]q := {r G Q : 0 < r < 1}. 

Lemma 1. Let f : [0; 1] — >■ R be computable. There is a computable function 
/? : Nx [0; 1 ]q — >■ Q such that |/(r) —j3{n, r)| < 2“” for any n G N and r G [0; 1]q. 
In this case we call /3 an effective determinator of function f . 

Let C C R and F a set of functions / :C [0; 1] — >■ R. Denote by F(C) the 
image set of C under the functions of F. Furthermore we define F"(C), for any 
n G N, inductively by F°(C) := C and F”+^(C) := F(F”(C)). The closure 
F*(C) of C under the functions of F is defined now by F*(C) := lj„gp^ F”(C). 
We are especially interested in the classes FIN and TOT of computable total 
real functions from [0; 1] to [0; 1] of finite and arbitrary length, respectively. For 
convenience we will call the function / G TOT (/ G FIN) total computable 
real function (of finite length) without mention of the interval [0; 1]. Since the 
composition of two computable real functions is a computable real function, we 
have TOT*(C) = TOT”(C) = TOT(C) for any n > 1. However FIN(C) = 
FIN*(C) does not hold in general. 

^ We use / :C X Y to denote that / is a partial function from X to Y while 
/ : X — >■ y is a total function from X to Y. 
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Let (Me)eGN be an effective enumeration of all Turing machines and ((/?e)eGN 
an effective enumeration of all computable functions (/?e :C N — >■ N such that (pe 
is computed by Mg. We write ipe{n) i if n € dom((/?g) and (pe{n) f, otherwise. 
The uniformly effective approximation (/?e,s of ipe is defined by (pe,s(n) '■= m, 
if Mf.{n) halts in s steps and outputs m; and is undefined, otherwise. 

Naturally, we assume that e,n, m < s if pe,s{n) = m. For other types of the 
functions, say, / :C N — >■ Q, a similar effective enumeration and corresponding 
approximation can be introduced accordingly. 

For any alphabet S, let E* and be the sets of all finite strings and infinite 
sequences of E, respectively. For u,v G E*, denote by uv the concatenation of 
V after tt. If ru G if* U E‘^, then w[n] denotes its n-th element. Thus, w = 
w[0]t(;[l] • • • w[n — 1], if the length of w, is n, and w = w[0]r(;[l]w[2] • • •, if 
|w| = oo. Obviously, ru[n] is defined only for n < licl. We will say also that 
w[n] is undefined (written w[n] =t), if n > licl. The unique string of length 0 is 
always denoted by A (so-called empty string). For any finite string w G {0; 1}*, 
and number n < the restriction w \ n is defined by (w f n)[i] := w[i] if 
i < n and (w f n)[i\ :=tj otherwise. The generalized restriction w\n is defined 
by w\n := w ( {n + 1), if n < |rc| and w\n := otherwise. For 

M, V G E* U if u = V \ n for some n < |u|, then we call u an initial 
segment of v and denote it by m C v. We denote also uj^v&zuQvhyufZv. 
Let (•, •) : — >■ N be a computable pairing function with computable reverse 

functions. 



3 Technical Lemma 

In this section we will show two technical results which will be used to construct 
some total computable real functions satisfying some special conditions. We 
introduce at first a notion of interval trees. 

Definition 1. Let <5 : N — >■ N+ be a function and I the set of all rational intervals 
on [0; 1] and NJ := {w G N* : Vn < \w\ {w[n\ < <5(n))}. 

1. A 6-interval tree (5-i.t., for short) on [0; 1] is a function / : NJ — >■ I such that 
/(A) = [0; 1]; Ui< 5 (|u;|) Hwi) = I{w), for any w G and lim^^oo ^(/(w*)) = 
0 for any increasing sequence (ws)sgN of NJ in the sense of Ws C Ws+i, where 
l{I{w)) is the length of the interval I{w). For any (5-i.t. / and w G N|, the 

interval I{w) is denoted by I{w) := [a^; 6^] and its length is denoted by 
]5 _ s 

'‘W ■ ^w 

2. A (5-i.t. I is called computable if the functions a, 6 : N| — >■ [0; 1 ]q defined by 
a{w) := and b{w) := b^, respectively, are computable. 

3. A (5-i.t. I is called canonical if, for any w G NJ, the interval I{w) is di- 

vided into <5(|w|) disjunctive subintervals /(wO), /(icl), • • • , I(w((5(|w|) — 1)) 
of the same length, namely, := ' Ojx* and b^ := 

at + rij<|™| Hj)~^ where EiG 0 (’ ’ ■) := 0 and rii 60 (- ’ •) := 1- 
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Notice that, for a canonical <5-i.t. I, the set := {I{wi) : i < i5(|t(;|)} is a 
partition of the interval I{w) and all subintervals in have the same length. 
But, general J-i.t. I, is not necessarily disjunctive. Besides, it is easy to see 
that a canonical (5-i.t. is always computable, if S is computable. We are interested 
mainly in 5-i.t. ’s on [0; 1]. So the phrase “on [0; 1]” is often omitted later. 5-i.t. ’s 
will be very useful to define some total computable real functions which satisfy 
special properties, if these properties can be approximated by some “compatible” 
function l defined as follows. 

Definition 2. Let 5i,52 : N — >■ N"*" and i :C be functions. 

1. L is called (5i, 82) -compatible if the following conditions hold. 

(a) The domain dom(t) of t is infinite and is alternate in the sense that, 
w{i — 1), w(z -b 1) ^ dom(t) for any wi € dom(i) and 0 < f < 5i(|r(;|); 

(b) 'iw,v € {w € dom(6) & w C w =b v € dom(i) & l{v) C t(rc))); 

(c) Vu, u G dom(t) (|m| = |u| =kjr('u)| = jt(u)|). 

2. L is called (5i, 82) -increasing, if for any w G dom(6) and i < j. 

3. L is called effectively modular if there is a recursive function g :C N — >■ N such 
that, wi G dom(t) iff i G dom(g) & t(wi) = i{w)g{i), for any w G dom(t). 



Theorem 3. Let 81,82 ■ N — >■ N“*" be recursive functions, I\ a canonical 8\-i.t., 
and I2 a computable 82-i-t. such that if^ < for some recursive function e 

and all w G If l :C — >■ is a {81, 82) -compatible computable function, 

then there is a computable function f : [0; 1] — >■ [0; 1] such that f{Ii{w)) C 
l2{i-{w)) for all w G dom(t). If l is also {8 1,82) -increasing, then such computable 
function f can even be also increasing. 

Notice that, a function / is increasing here means that x < y =k f{x) < f{y), 
namely non-strictly increase. The computable function / constructed in Theorem 

3 can have finite length if it is increasing. Next Theorem shows that / can have 
also finite length even if it is not monotone. 

Theorem 4. Let 5i := ki and 82 ■= fe be constant functions, I\ and I2 the 
canonical 8i-i.t. and 82-i-t., respectively. 1/2^2 > k\ and l :C — >■ is a 

{81, 82) -compatible and effectively modular function, then there is a computable 
function f : [0; 1] — >■ [0; 1] of finite length such that f{Ii{w)) C for all 

w G dom(t) . 

4 Closure Properties under FIN 

In this section we discuss the closure properties of some real number classes 
under total computable real functions of finite length. We will see that neither 
Csc nor Cwc is closed under FIN. 

Notice that, the computable function / : [0; 1] — >■ [0; 1] defined by f{x) := 
1—x has finite length and maps left computable real numbers to right computable 
ones and vice versa. Furthermore, for any g G FIN, go/ and g have the same 




592 Robert Rettinger et al. 



length. Then it is easy to see that FIN(C/c) = FIN(Crc) = FIN(Csc)- Therefore 
we often state our results only about the class FIN(Csc) and omit the other 
two classes. We show at first that C^c is not closed under FIN and its image 
FIN(Csc) is contained in C^c- 

Theorem 5. C^c C FIN(Csc) ^ Cwc- 

Proof. (Sketch for the first part) We define a computable function / : [0; 1] — >■ 
[0; 1] by applying Theorem 4. Let i5i(n) = 62 ( 71 ) = 7 for any n G N and Ii, J 2 
the canonical i5i- and (^ 2 -interval trees, respectively. Then we have 
for any w G N|^. Define a computable function i :C — >■ by t(A) := A and 

L(wa) := l(w)(6 — a), if a G {1,3,5}, and i.(wa) :=f, otherwise. Obviously, t is 
(<5i, (^ 2 ) -compatible and effectively modular. By Theorem 4, there is a computable 
function / : [0; 1] — >■ [0; 1] of finite length such that f(Ii{w)) C I 2 (l{w)) for any 
w G dom(t). 

Let be an effective enumeration of computable (pi :C N — >■ Q. Then 

H := |(i, n) : (pi(n) } Sz (\/m < n (ipi(m) < (pi(m -1-1)) or \/m < n (ipi(m) > 
(pi(m+ 1)))} is an r.e. set. Especially, there is an effective enumeration function 
(/ : N — >■ iL such that 

g(s) = {i,n) & n yf 0 ^ 3s' < s(g(s') = {i,n- 1)). (1) 

We construct now a computable increasing sequence of rational num- 

bers by Xs ■= for some computable sequence (wg)sgN of dom(/i) which is 
constructed in stages as follows. 

Stage s = 0: Define simply wq ■= “1”. 

Stage s-b 1: Given Ws G dom(/i). Suppose that g(s) = {i,n). Define Wg+i := 
Wgl if (pi(n) ^ l 2 (i-(ws\i)) and, otherwise, define Ws-i-i of length maxji, |ii;s|}-|- 1 

by 



Ws+i[m] 



(ws|"i)[i] -b 2 
(ws|'i)[ij -b4 
(ws \i)[m] 

1 



if m = i Sz ipi(n — 1) > ipi(n) 
if m = i Sz ipi(n — 1) < Lpi(n) 
if m < i 
otherwise. 



(2) 



We can show that (a;s)sGN is a well defined computable increasing sequence of 
rational numbers which converges to such that /(x^^) is not semi-computable. 

Next Theorem shows that the class C„,c is not closed under monotone total 
computable real functions which have, of course, finite length. 

Theorem 6. There is a computable increasing function f : [0; 1] — >■ [0; 1] and a 
weakly computable real number x G [0; 1] such that f(x) is not weakly computable. 
Hence the class C^jc is not closed under FINand Cmc S FIN(Cu;c) 

Proof. We define at first a recursive function 0 : N — >■ N inductively by d(0) := 5 
and 0(n+ 1) := (J2m<n f^(™) 3- 1) • (n-b 1) • 5”+^. Furthermore, let i5i(n) := 9(n) 
and S 2 (n) := 5 be two recursive functions, I\,l 2 the corresponding canonical di- 
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and (^ 2 -interval trees, respectively. Then we have at first. Define a 

computable function l : — >■ with dom(t) = {1,3}* by l{w) := w. Obvi- 

ously, t is also (i5i, (52)-compatible and ((5i, (52)-increasing. By Theorem 3, there 
is an increasing computable function / : [0; 1] — >■ [0; 1] such that f{Ii{w)) C 
/ 2 (t(tu)) = l 2 {w). Now we will construct a computable sequence (xs)ggN of 
tional numbers which converges weakly effectively to some x such that f{x) is not 
weakly computable. Namely, x satisfies, for all t € N, the following requirements: 

OO 

: dom(v3i) = N & ^ Wiin) - ipi{n -b 1)| <i V“n(|/(a;) - ipi(n)\ > 5“*) 

n—0 



where (v3z)ign is an effective enumeration of computable functions :C N — >■ Q. 
Let H := |(i,n) : Vm < n{ipi{m) {)}. Then H is r.e. and hence there is a 
recursive function g : N — >■ N such that rang(g) = H. Especially, we can assume 
that g satisfies condition (1) in the proof of Theorem 5. The sequence (xg)sgN 
is defined by Xg '■= for the computable sequence (ws)sgN of which is 
constructed effectively in stages as follows. 

Stage s = 0: Define wq := “1”. 

Stage s-bl: Given Wg G Let g{s) = {i,n). If Wi{'m)-ipi{m+l)\ > i 

or Lpi{n) ^ l 2 {i-{ws\i)), then define simply Wg+i := Wg and hence a;s+i := Xg. 
Otherwise we define Ws+i of length i -b 1 by 



Wg+i[m] 



1 if m = z & rus [m] = 3 

3 if m = i Sz Wg [m] = 1 

Wg[m] if m < z 

t otherwise. 



(3) 



In this case, we call the stage s -b 1 an z-stage. 

We can show that there are at most i ■ 5* z-stages between stages si and s, 
for any z and si < s such that si is an zi-stage for some zi < z or si = 0 and 
there is no j-stage between si and s for any j < i. Therefore, there are at most 
9{i) z-stages totally. Since \xg — a:s+i| = sequence (xg)seN 

satisfies ~ a;s+i| < 2. Namely, x := lims_>oo exists and is w.c. On 

the other hand, we can also show that Ri is satisfied for any z and hence /(x) is 
not weakly computable. 



Corollary 1. FIN(Csc) C FIN(C^c)- 



5 Closure Properties under TOT 

Different from the case for FIN(Cgc) and FIN(Cu,c) we will show in this section 
that TOT(Csc) and TOT(Cu,c) coincide. Furthermore TOT(Cujc) is an algebraic 
field which is strictly between C^c and Cra- This class can also be described 
by the limits of computable sequences of rational numbers with some kind of 
restriction on their convergence. Since TOT*(C) = TOT(C) for any C C R it 
suffices to mention only the results about TOT(C). 

At first, we introduce a kind of pairing function of real numbers. 
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Definition 3. The pairing function © : — >■ R zs defined, for infinite ^1, B C N 

by 

XA © xb := ^ {5-(2 *+i) : j g |J{3 . 5-(2*+i) ■, i ^ A} 

y|5-(2i+2) . j g 5} y{3 . 5-(2i+2) 



The pairing function © is unfortunately not computable. However its two 
inverse functions are computable as shown in the following lemma. 

Lemma 2. The pairing function © : R^ — >■ R satisfies the following conditions: 

1 . If X and y are left computable, then x (By is also left computable; 

2 . There are computable functions 7ri,7r2 : [0; 1]^ — >■ [0; 1] such that, for any 

x,y e [ 0 ; 1 ], TTi{x ® y) = X, Tr2{x ® y) = y and ® tt2{x) = x . 

Theorem 7. 1 . C^c TOT(C/c); 

2 . TOT(C^c) = TOT(Csc) and TOT(C^c) is an algebraic field. 

Now we show that the closure TOT(Cu,c) does not exhaust the class C^a- 
Theorem 8. C^c Q TOT(Ci„c) Si 

Proof. We sketch only the idea for TOT(Cu,c) = TOT(Cic) $i Cra- Let (a„)„gN, 
(/3n)nsN and (7„)neN be effective enumerations of computable functions a„ :C 
N — >■ N, /3„ :C NxQ — >• Q and 7„ :C N — >■ Q, respectively. For any a G TOT(C;c), 
there is a computable function / : [0; 1] — >■ R and a computable increasing 
sequence (7fc(n))„gN of rational numbers such that b := lim„_>oo 7fe(n.) G [0; 1] 
and f{b) = a. By Lemma 1, / has a computable determinator function ftj : 
N X Q — >• Q. Namely, \j 3 fin.r) — /(r)| < 2“” for any n G N and r G [0; 1 ]q. 
Furthermore, by Theorem 2.3, / has a computable modulus : N — >■ N such 
that Vx, y G [0; 1] Vn G N {\x — y\ < \ f(x) — f{y)\ < 6“”). 

Therefore, it suffices to construct a computable sequence (xg)sgN of rational 
numbers such that its limit x := lims_>oo Xg exists and satisfies, for all e := 
{i,j,k) G N, the following requirements: 

If ai,Pj,jk are total and (7fc(n))„gN converges increasingly to 7^ 
then 3m(|7fc(m) — 7fc| < & |/3j(e, 7fc(m)) —x\> 6“^®+^^). 

To satisfy Re (e := {i,j, k)), we choose a rational interval (a; b) of length 
as a base interval. This interval is divided by a := oq < oi <02 <03 <04 < 
05 < Oq := b into six subintervals for t < 6 of the same length We 

wait for some stage s such that ai{e) := ai,s(e) is defined. Then look for the 
largest m < s such that 7fc,s(0) < 7fc,s(l) < • • • < 7fc,s(w) < 1 are all defined. 
If /3j,s(e, 7fc,s(m)) is also defined and located in the interval R for some t < 6 , 
then define x := ai, if t > 3 and x := 04, otherwise. If there is no such s, then 
define simply x := Oi. If at a later stage Si > s, we find some mi > m such 
that |7fc, sj(m) — 7fe,si(mi)| > 2““de)^ repeat the above procedure for 
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m-i again, and so on. This can happen only finitely often, if 7 fe(n) converges to 
some 7 fe € [0; 1] increasingly. Thus we can always successfully find such x which 
satisfies Re- 

To satisfy all requirements simultaneously, we apply the above strategies for 
Re to an interval I{w) with |ix;| = e for the canonical (5-interval tree I on [0; 1] 
for (5 := 6 by the priority injury technique. The details are omitted here. 

At last we characterize the class TOT(Cu,c) by some convergence property 
of computable sequences of rational numbers. 

Definition 4. Let be any sequence of real numbers and n G N. We 

define the degree of n- divergence of the sequence (oijigN as the maximal natural 
number m which satisfies the condition that there is a chain of natural numbers 
*0 < Jo < *1 < ji < • • • < < jm such that I > for any s < m. 

Theorem 9. The real number x belongs to TOT{Cyjc) iff there is a computable 
sequence of rational numbers which converges to x and a recursive func- 

tion / : N — N such that, for all n € N, a{n) < f{n), where a{n) is the degree 
of n-divergence o/(aj)jgN- 
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Abstract. We stndy Turing computability of the nonlinear solution op- 
erator S of the Cauchy problem for the Schrodinger equation of the form 



.du dru I ,2 

t— = + mu + \u\ u 
dt dx^ 



in K. We prove that S' is a computable operator from (K) to (R)) 

with respect to the canonical representations. 



There is a rich mathematical theory for the Schrodinger equation of the form 
= (—A -I- m)u + \u\^u, t gR, X G R”, 

where A is the Laplace operator in R" and m is a real constant. The equa- 
tion is widely used in several domains of applied physics. For example, it can 
be considered as the classical approximation to the field equation for a quan- 
tum mechanical nonrelativistic many body system with a two body 5-function 
interaction. It is also used in the Landau-Ginsburg theory of superconductivity. 

Our interest is to study computability of the solution operator. It is com- 
monly believed among physicists that for processes which can be described by 
well-established theories, such as quantum systems described by the Schrodinger 
equation, the future behavior can be computed with arbitrary precision at least 
in principle from sufficiently precisely given initial conditions, where the com- 
putations can be performed on digital computers, hence on Turing machines. In 
this note we put the common belief into a formal basis of computable analy- 
sis, which is the theory of real number computation. We study computability of 
the nonlinear Schrodinger propagator within the framework of Type-2 Theory 
of Effectivity (TTE) [WeiOO], which is based on work by Turing, Grzegorczyk, 
Lacombe, Ko, Kreitz and Weihrauch, Pour-El and Richards, and many others 
[Tur36, Grz57, KW85, PER89, Ko91]. For simplicity of notation we consider 
only the case of space dimension n = 1. 

The main result is the following theorem. 
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Theorem 1 The solution operator S : H^(M) — >■ i?^(R)) of the Cauchy 

problem 

= {—A + m)u+ \u\‘^u, t G R, x G R, (1) 

■u(0)(x) = ip€H^(R), (2) 

where S{(p) = u and m is a computable real number, is {Sh^,[p — >■ <5//i])- 
computable. 

We use the framework of TTE[WeiOO]. Let L^(R) denote the set of all 
functions defined on R equipped with L^-norm, iL®(R) the Sobolev space, s > 0, 
C(R, iJ®(R)) the set of all continuous functions g : R — >■ iL'*(R) equipped with 
the compact-open topology, and 5(R) the Schwartz space defined by 

5(R) = {(j)GC^{R) : Vo,/3g N,sup|x“</>(^)(x)| < oo}. 

Let 5^2 and be the standard representations of L^(R) and iL^(R), respec- 
tively, [p — >■ the representation of C(R, iJ®(R)), and 5sc the Cauchy rep- 

resentation of 5(R). For the details about these representations the reader is 
refereed to [WeiOO], [WZ99], and [WZOl]. We need another representation, de- 
noted as 5h^, of iJ®(R) [WZOl]. The representation 5h‘ makes use of the classical 
result that the set of Schwartz functions is dense in iL®(R). 

Lemma 2 The representation 5 h‘ is equivalent to the representation 6 h‘, if 
s G R zs computable. 

Proof: See [WZOl]. 

We now consider the Cauchy problem (1, 2). We use a fixed point argument 
to construct a Type-2 Turing machine which computes a [p — >■ <5^fi]-name of the 
solution when input a i5//i-name of the initial function. The Cauchy problem 
(1,2) can be recasted in the form of the integral equation 

u{t) = U{t — to,'ip) — i f C/(t — T, |M(r)|^M(r))dr (3) 

Jto 

with u{to) = Ip. We will first construct a segment of u on the interval [to ~ 
Tip, to + Tp] by a fixed point iteration, where Tp is a constant depending only on 
the initial condition p of (1, 2). The following two conservation laws are needed. 



Lemma 3 If (R) is a solution of (1), then for any t, t' , to G R, 

< /(llu(io)llffi) (4) 

\\U{t-t',u{t'mH^<f{\\u{to)\\m), (5) 

where f{x) = x • (1 -I- 4-\/2x)^/^ and [/ : R x 5(R) — >■ 5(R) is the free evolution 
defined by U(t, 2 p) := ( 27 r)“^/^ /ju 
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Proof: See [WZOl]. 

We define two maps A and G. The map ^ : R x ff^(R) x C(R, H^(R)) —>■ 
C(R,iJi(R)) is defined by 

= U{t-to,i}) - i f U{t-T,\v{T)\‘^v{T))dT, 

Jto 

and the map G : R x G(R, H^{R)) — >■ G(R, iJ^(R)) is defined by 

G{to,v){t) = -i f U{t - T,\v{T)\‘^v{T))dT. 

Jto 

On a small neighborhood I of to, A{to,4’) is a contraction on G{I,H^{R)), if tp 
is not too big. 

Lemma 4 Let r,p he a rational number such that r,p — 1 < ||v?||ffi < fif, let 
R,p he the least integer upper hound of f{r,p) + 1 and let I := [to — 21^, to + 21^]; 
where T,p = l/(32i?^). Then for any tp € H^(R) such that HV'llffi ^ etnd any 
vi,V2 G G(R;i2^(R)), 

\\Mto,^)vi - A{to,'ip)v2\\i = ||G(to,?;i) - G(to,W2)||/ < ^Iki - ^^2||/ (6) 

where ||w||/ := ||n||c(/,_f/i(R) = sup^gj ||?;(t)||ffi(R). 

Proof: See [GV79]. 

The fixed point of the contraction A{to,tp) is the solution of (3) such that 
u{to) = tp- We are able to show that the restriction of the operator A to 5(R) 
is computable (see Lemma 5 below). So we will approximate tp G iJ^(R) by a 
Cauchy sequence tpk of 5 (R) -functions (see Lemma 2) and use approximations of 
the fixed points of A{to,tpk) to compute the fixed point of A{to,tp) (see Lemma 
6 below). 

Lemma 5 The restriction B of the operator A to S (R) is 
{p, Ssc, [p <5sc], [p 5 sc]) -computable. 



Proof: See [WZOl]. 



Lemma 6 Let ^ R^p ctnd \\tp — tpkWnt < 2~^ for k = 0,1,..., where 

tpk’s are S(R) -functions. Let v.^ he the fixed point of A(fo,tp). Then there are 
computable functions (/i , 32 : N x N — >■ N such that 

\K - ( 0 ) 11 / < 2 -™ 
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Proof: See [WZOl]. 

Combining lemmas 4 - 6 we can construct a Type-Two Turing machine which 

computes u{t) from the initial condition Lp G and arbitrary t G R. 

Proof of Theorem 1: See [WZOl]. 
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